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! Abstract 

I— I. An aperiodic tile set was first constructed by R. Berger while proving the undecidability of the 

domino problem. It turned out that aperiodic tile sets appear in many fields, ranging from logic (the 

. Entscheidungsproblem) to physics (quasicrystals). 
^ ' We present a new construction of an aperiodic tile set that is based on Kleene's fixed-point construc- 

O . tion instead of geometric arguments. This construction is similar to J. von Neumann's self-reproducing 
automata; similar ideas were also used by R Gacs in the context of error-correcting computations. 
\^ . This construction is rather flexible, so it can be used in many ways. We show how it can be used to 

^ ' implement substitution rules, to construct strongly aperiodic tile sets (in which any tiling is far from any 

. periodic tiling), to give a new proof for the undecidability of the domino problem and related results, to 

' characterize effectively closed one-dimensional subshifts in terms of two-dimensional subshifts of finite 

. type (an improvement of a result by M. Hochman), to construct a tile set that has only complex tilings, 

- , • ' and to construct a "robust" aperiodic tile set that does not have periodic (or close to periodic) tilings even 

^"T . if we allow some (sparse enough) tiling errors. For the latter, we develop a hierarchical classification of 

Q>^ ' points in random sets into islands of different ranks. Finally, we combine and modify our tools to prove 

. our main result: There exists a tile set such that all tilings have high Kolmogorov complexity even if 

sjjl ' (sparse enough) tiling errors are allowed. 
• ^ . Some of these results were included in the DLT extended abstract LIOJ and in the ICALP extended 

^ ■ abstract llTTl . 
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1 Introduction 



In this paper, tiles are unit squares with colored sides. Tiles are considered as prototypes: we may place 
translated copies of the same tile into different cells of a cell paper (rotations are not allowed). Tiles in the 
neighbor cells should match (i.e., the common sides should each have the same color). 

Formally speaking, we consider a finite set C of colors. A tile is a quadruple of colors (left, right, top, 
and bottom ones), i.e., an element of C^. A tile set is a subset T CC^. A tiling of the plane with tiles from T 
(z-tiling) is a mapping U : I? ^ T that respects the color-matching condition. 

A tiling U is periodic if it has a period, i.e., a nonzero vector T ^1? such that ?7(;c + T) = U ix) for all 
X G 1?. Otherwise, the tiling is aperiodic. The following classical result was proved in 13]: 

Theorem 1. There exists a tile set T such that T -tilings exist and all of them are aperiodic. 

The construction from the proof of Theorem[T]was used in 131 as the main tool to prove Berger's theorem: 
The domino problem (to find out whether or not a given tile set has tilings) is undecidable. 

The first tile set of Berger was rather complicated. Later, many other constructions were suggested. 
Some of them are simplified versions of Berger's construction ([29 1; see also the expositions in [1. 8. , 22.1). 
Some others are based on polygonal tilings (including the famous Penrose and Ammann tilings; see |[T5l ). 
An ingenious construction suggested in ||l9l is based on multiplication in a kind of positional number system 
and gives a small aperiodic set of 14 tiles (and in [6 | an improved version with 13 tiles is presented). Another 
nice construction with a short and simple proof (based explicitly on ideas of self-similarity) was recently 
proposed in ll27l . 

In this paper, we present yet another construction of an aperiodic tile set. It does not provide a small tile 
set; however, we find it interesting for the following reasons: 

• The existence of an aperiodic tile set becomes a simple application of the classical construction used 
in Kleene's fixed-point (recursion) theorem, in von Neumann's self -reproducing automata f26l, and, 
more recently, in Gacs' reliable cellular automata | IT2llT3l ; we do not use any geometric tricks. An 
aperiodic tile set is not only an interesting result but an important tool (e.g., this construction was 
invented to prove that the domino problem is undecidable); our construction makes this tool easier to 
use. 

• The construction is rather general, so it is flexible enough to achieve some additional properties of the 
tile set. We illustrate this flexibility by providing new proofs for several known results and proving 
new results; these new results add robustness (resistance to sparse enough errors) to known results 
about aperiodic tile sets and tile sets that have only complex tilings. 

It is unclear whether this kind of robustness can be achieved for previously known constructions of tile 
sets. On the other hand, robustness properties appear to be important. For example, mathematical models 
for processes such as quasicrystal growth or DNA computation should take errors into account. Note that 
our model (with its independent choice of places where errors are allowed) has no direct physical meaning; 
it is just a simple mathematical model that can be used as a playground to develop tools for estimating the 
consequences of tiling errors. 

The paper is organized as follows: 

• In Section |2l we present the fixed-point construction of an aperiodic tile set (new proof of Berger's 
theorem), and we illustrate the flexibility of this construction by several examples. 
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• In Section [3l we show that any "uniform" substitution rule can be implemented by a tile set (thus 
providing a new proof for this rather old result). 

• In Section |4j we use substitutions to show that there are strongly aperiodic tile sets (which means that 
any tiling is strongly aperiodic, i.e., any shift changes at least some fixed fraction of tiles). 

• The fixed-point construction of Section|2]provides a self-similar tiling: Blocks of size nxn ("macro- 
tiles") behave exactly as individual tiles, so on the next level we have x blocks made of nxn 
macro-tiles that have the same behavior, etc. In Section[5l we make some changes in our construction 
that allow us to get variable zoom factors (the numbers of tiles in macro-tiles increase as the level 
increases). 

Variable zoom factor tilings can be used for simulating computations (with higher levels performing 
more computation steps); we use them to give a simple proof of the undecidability of the domino 
problem. The main technical difficulty in the standard proof was to synchronize computations on 
different levels. In our construction this is not needed. We show also that other undecidabihty results 
can be obtained in this way. 

• This technique can be used to push the strong aperiodicity to its limits: The distance between every 
tiling and every periodic configuration (or between every tiUng and its nontrivial shift) can be made 
arbitrarily close to 1, not only separated from 0. This is done in Section [6] using an additional tool: 
error-correcting codes. 

• In fT|, a tile set was constructed such that every tiling has maximal Kolmogorov complexity of frag- 
ments iQ.{n) for nxn squares); all tilings for this tile set are noncomputable (thereby implying a 
classical result of Hanf 1 17] and Myers |[25]| as a corollary). The construction in [7] was rather com- 
plicated and was based on a classical construction of an aperiodic tile set. In Section |7J we provide 
another proof of the same result that uses variable zoom factors. It is simpler in some respects and can 
be generalized to produce robust tile sets with complex tiling, which is our main result (Section [T3]). 

• In Section m we use the same technique to give a new proof for some results by Simpson |32] and 
Hochman |18| about effectively closed subshifts: Every one-dimensional effectively closed subshift 
can be obtained as a projection of some two-dimensional subshift of finite type (in an extended alpha- 
bet). Our construction provides a solution of Problem 9.1 from ifTSl . (Another solution, based on the 
classical Robinson-type construction, was independently suggested by Aubrun and Sablik; see 13.) 

• To prove the robustness of tile sets against sparse errors we use a hierarchical classification of the 
elements of random sets into islands of different levels (a method that goes back to Gacs |lT,T4]). This 
method is described in Section [9?T] In Section [9!2l we give definitions and establish some probabilistic 
results about islands that are used later to prove robustness. We show that a sparse random set on 1? 
with probability 1 (for Bernoulli distribution) can be represented as a union of "islands" of different 
ranks. The higher the rank, the bigger is the size of an island; the islands are well isolated from 
each other (i.e., in some neighborhood of an island of rank k, there are no other islands of rank > k). 
Then, in Section 1931 we illustrate these tools using standard results of percolation theory as a model 
example. In Section [9!4l we modify the definition of an island by allowing two (but not three!) islands 
of the same rank to be close to each other. This more complicated definition is necessary to obtain the 
most technically involved result of the paper in Section[T3]but can be skipped if the reader is interested 
in the other results. 
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• In Section \T0\ we use a fixed-point construction to get an aperiodic tile set that is robust in the fol- 
lowing sense: If a tiling has a "hole" of size n, then this hole can be patched by changing only an 
C?(?i)-size zone around it. Moreover, we do not need for this a tiling of the entire plane. An 0{n) zone 
(with bigger constant in O notation) around the hole is enough. 

• In Section [m we explain how to get robust aperiodic tile sets with variable zoom factors. Again, this 
material is used in Section [T3] only. 

• In Section [T2I we combine the developed techniques to establish one of our main results: There exists 
a tile set such that every tiling of the plane minus a sparse set of random points is far from every 
periodic tiling. 

• Finally, Section [13] contains our most technically difficult result: a robust tile set such that all tilings, 
even with sparsely placed holes, have linear complexity of fragments. To this end we need to combine 
all our techniques: fixed-point construction with variable zoom factors, splitting of a random set into 
doubled islands (we shall call them bi-islands), and "robustification" with filling of holes. 

2 Fixed-point aperiodic tile set 
2.1 Macro-tiles and simulation 

Fix a tile set T and an integer N > I {zoom factor). A macro-tile isanN xN square tiled by T-tiles matching 
each other (i.e., a square block of N'^ tiles with no color conflicts inside). We can consider macro-tiles as 
"preassembled" blocks of tiles; instead of tiling the plane with individual tiles, we may use macro-tiles. To 
get a correct T-tiling in this way, we need only to ensure that neighbor macro-tiles have matching macro- 
colors, so there are no color mismatches on the borders between macro-tiles. More formally, by macro-color 
we mean a sequence of colors on the side of a macro-tile (i.e., the right macro-color is a sequence of the 
right colors of the tiles on the right edge of a macro-tile, and the same for the left, the top, and the bottom 
macro-color). Each macro-tile has four macro-colors (one for each side). We always assume that macro-tiles 
are placed side to side, so the plane is split into N xN squares by vertical and horizontal lines. 

In the following we are interested in the situation when T-tilings can be split uniquely into macro-tiles 
that behave like tiles from some other tile set p. Formally, let us define the notion of a simulation. 

Let T and p be two tile sets, and let A/^ > 1 be an integer. By simulation of p by z with zoom factor N we 
mean a mapping S of p-tiles into N xN T-macro-tiles such that the following properties hold: 

• 5 is injective (i.e., different tiles are mapped into different macro-tiles). 

• Two tiles ri and r2 match if and only if their images S{ri) and 5(r2) match. This means that the 
right color of ri equals the left color of r2 if and only if the right macro-color of S{ri ) equals the left 
macro-color of S{r2), and the same is true in the vertical direction. 

• Every T-tiling can be split by vertical and horizontal lines into N xN macro-tiles that belong to the 
range of S, and such a splitting in unique. 

The second condition guarantees that every p -tiling can be transformed into a T-tiling by replacing each 
tile r G p by its image, macro-tile S{r). Taking into account other conditions, we conclude that every T- 
tiling can be obtained in this way, and the positions of grid lines as well as the corresponding p-tiles can be 
reconstructed uniquely. 
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Example 1 (negative). Assume that T consists of one tile with four white sides. Fix some A'^ > 1. 
There exists a single macro-tile of size N xN. Does this mean that T simulates itself (when its only tile is 
mapped to the only macro-tile)? No. The first and second conditions are true, but the third one is false: The 
placement of cutting hnes is not unique. 
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Figure 1 : Tiles and macro-tiles for Example 2. 

Example 2 (positive). In this example p consists of one tile with all white sides. The tile set T consists of 
tiles indexed by pairs of integers modulo N. A tile from T has colors on its sides as shown on Fig.[T] 
(each color is a pair of integers modulo A'^, so set C of all colors consists of N'^ elements). The simulation 
maps the single p-tile to a macro-tile that has colors (0,0), . . . , (0,A'^ — 1) and (0,0), . . . , (A'^ — 1,0) on its 
vertical and horizontal borders, respectively (see Fig. [Hi. 

Definition. A self-similar tile set is a tile set that simulates itself. 

The idea of self-similarity is used (more or less explicitly) in most constructions of aperiodic tile sets 
(but ||19, 6j| are exceptions). However, not all of these constructions provide literally self-similar tile sets in 
our sense. 

It is easy to see that self-similarity guarantees aperiodicity. 
Proposition 1. A self-similar tile set z may have only aperiodic tilings. 

Proof. Let 5 be a simulation of T by itself with zoom factor N. By definition, every T-tiling U can be 
uniquely split into N xN macro-tiles from the range of S. So every period T of ?7 is a multiple of A'^ (since 
the T-shift of a cut is also a cut, the shift should respect borders between macro-tiles). Replacing each 
macro-tile by its S'-preimage, we get a T-tiling that has period T /N. Therefore, T /N is again a multiple of 
A'^. Iterating this argument, we conclude that T is divisible by A'^*^ for every k, so T. □ 

Note also that every self-similar tile set has arbitrarily large finite tilings. Starting with some tile, we 
apply S iteratively and get a big tiled square. The standard compactness argument guarantees the existence 
of a tiling of the entire plane. Therefore, to prove the existence of aperiodic tile sets it is enough to construct 
a self-similar tile set. 

Theorem 2. There exists a self-similar tile set t. 

Theorem [2] was explicitly formulated and proven by OUinger |[27l ; in his proof a self-similar tile set 
(consisting of 104 tiles) is constructed explicitly. This tile set is then used to implement substitution rules 
(cf. Theorem [3] below). Another example of a self-similar tile set (with many more tiles) is given in [Sj. 
(Note that the definition of self-similarity used in [SJ is a bit stronger.) 

We prefer a less specific and more flexible argument based on the fixed-point idea. Our proof works for 
a vast class of tile sets (though we cannot provide explicitly an aperiodic tile set of a reasonably small size). 
The rest of this section is devoted to our proof of Theorem |2l Before we prove this result, we explain a few 
techniques used in our construction and show how to simulate a given tile set by embedding computations. 
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2.2 Simulating a tile set 



Let us start with some informal discussion. Assume that we have a tile set p whose colors are ^-bit strings 
(C = {0, 1}*) and the set of tiles p C is presented as a predicate R{ci , C2, C3 , C4) with four ^-bit arguments. 
Assume that we have some Turing machine ^ that computes R. Let us show how to simulate p using some 
other tile set T. 

This construction extends Example 2, but it simulates a tile set p that contains not a single tile but many 
tiles. We keep the coordinate system modulo A'^ embedded into tiles of T; these coordinates guarantee that all 
T-tilings can be uniquely split into blocks of size N xN and every tile "knows" its position in the block (as 
in Example 2). In addition to the coordinate system, now each tile in T carries supplementary colors (from a 
finite set specified below) on its sides. These colors form a new "layer" which is superimposed with the old 
one; i.e., the set of colors is now a Cartesian product of the old one and the set of colors used in this layer. 

On the border of a macro-tile (i.e., when one of the coordinates is zero) only two supplementary colors 
(say, and 1) are allowed. So the macro-color encodes a string of N bits (where is the size of macro-tiles). 
We assume that is much bigger than k and let k bits in the middle of macro-tile sides represent colors from 
C. All other bits on the sides are zeros. (This is a restriction on tiles: Each tile "knows" its coordinates so it 
also knows whether nonzero supplementary colors are allowed.) 

Now we need additional restrictions on tiles in T that guarantee that macro-colors on the sides of each 
macro-tile satisfy relation R. To achieve this, we ensure that bits from the macro-tile sides are transferred to 
the central part of the tile where the checking computation of ^ is simulated (Fig.|2ll. 




Figure 2: Wires and processing zones; wires appear quite narrow since N ^ k. 

For that we need to fix which tiles in a macro-tile form "wires" (this can be done in any reasonable way; 
we assume that wires do not cross each other) and then require that each of these tiles carries equal bits on 
two sides (so some bit propagates along the entire wire); again this is easy to arrange since each tile knows 
its coordinates. 

Then, we check /? by a local rule that guarantees that the central part of a macro-tile represents a time- 
space diagram of ,^'s computation (with the tape being horizontal, and time increasing upward). This is done 
in a standard way: The time-space diagram (tableau) of a Turing machine computation can be described by 
local rules, and these rules can be embedded into a tile se{3 (see details in, e.g., lUllTSl). We require that 
computation terminates in an accepting state; if not, the tiling cannot be formed. 

'Speaking about local rules, we mean that one can check the correctness of the time-space diagram by looking through a 0(1)- 
size window; in the standard representation width 3 and height 2 is enough. However, our definition of a tile set is even more local: 
We compare colors on matching sides only. It is easy to see that we can still simulate any local rules by tiles. Each tile keeps the 
contents of the corresponding window, and colors are used to ensure that overlapping windows are consistent. 
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To make this construction work, the size of the macro-tile (N) should be large enough; we need enough 
space for k bits to propagate and enough time and space (= height and width) for all accepting computations 
of ^ to terminate. 

In this construction the number of supplementary colors depends on the machine ^ (the more states 
it has, the more colors are needed in the computation zone). To avoid this dependency, we replace ^ by 
a fixed universal Turing machine that runs a program simulating Let us agree that the tape of the 
universal Turing machine has an additional read-only layer. Each cell carries a bit that is not changed during 

I I I 



Universal 




Figure 3: Checking tiles with a universal Turing machine. 

the computation; these bits are used as a program for the universal machine . (We may assume that the 
program bits occupy some part of the reserved read-only layer, e.g., the leftmost bits on this layer; see 
Fig. [3l) In terms of our simulation, the columns of the computation zone carry unchanged bits (considered 
as a program for U), and the tile set restrictions guarantee that the central zone represents the record (time- 
space diagram) of an accepting computation of (with this program). In this way, we get a tile set T that 
simulates p with zoom factor using 0{N^) tiles. (Again we need N to be large enough, but the constant 
in 0{N^) does not depend on N.) 

2.3 Simulating itself 

We know how to simulate a given tile set p (represented as a program for the universal Turing machine) by 
another tile set T with a large enough zoom factor N. Now we want T to be identical to p in which case 
Proposition [T] guarantees aperiodicity). For this we use a construction that follows the proof of Kleene's 
recursion (fixed-point) theorem. 

We cannot refer here to the statement of the theorem; we need to recall its proof and adapt it to our 
framework. Kleene's theorem |[20l says that for every computable transformation 7i of programs one can 
find a program p such that p and 7l{p) are equivalent, i.e., produce the same output. (For simplicity we 
consider programs with no input, but this restriction does not really matter.) In other words, there is no 
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guaranteed way to transform a given program p into some other program n{p) that produces different 
output. As a sketch of the proof, first we note that the statement is language-independent since we may use 
translations in both directions before and after n. Therefore, without loss of generality, we may assume that 
the programming language has some special properties. First, we assume that it has a function GetText () 
that returns the text of the program (or a pointer to a memory address where the program text is kept). 
Second, we assume that the language contains an interpreter function Execute (string s) that interprets 
the content of its string argument s as a program written in the same language. It is not difficult to develop 
such a language and write an interpreter for it. Indeed, the interpreter can access the program text anyway, 
so it can copy the text into some string variable. The interpreter also can recursively call itself with another 
program as an argument when it sees the Execute call. If our language has these properties, it is easy to 
construct the fixed point for n: Just take the program Execute (7i:(GetText () ) ) . 

This theorem shows that a kind of self-reference, in which we write the program as if its full text is 
already given to us, is still acceptable. A classical example is a program that prints its own text. The proof 
shows a way how to do this by using a computation model where the immutable text of the program is 
accessible to it. 

Constructing a self-similar tiling, we have the same kind of problems. We have already seen how to 
construct a tile set T that simulates a given tile set p. [Counterpart: It is easy to write a program that prints 
any given text.] What we need is to construct a tile set that simulates itself. [Counterpart: What we need is 
to write a program that prints its own text.] 

Let us look again at our construction that transforms the description of p (a Turing machine that com- 
putes the corresponding predicate) into a tile set T that simulates p. Note that most rules of T do not depend 
on the program for dealing with information transfer along the wires, the vertical propagation of un- 
changed program bits, and the space-time diagram for the universal Turing machine in the computation 
zone. Making these rules a part of p's definition (by letting k = 2\ogN + 0{\) and encoding 0{N^) colors 
by 21ogA^-|- 0(1) bits), we get a program that checks that macro-tiles behave like T-tiles in this respect. 
Macro-tiles of the second level ("macro-macro-tiles") made of them would have the correct structure, wires 
that transmit bits to the computation zone, and even the record of some computation in this zone, but this 
computation could have an arbitrary program. Therefore, at the third level all the structure is lost. 

What do we need to add to our construction to close the circle and get self-simulation? The only 
remaining part of the rules for T (not implemented yet at the level of macro-tiles) is the hard-wired program. 
We need to ensure that macro-tiles carry the same program as T-tiles do. For that our program (for the 
universal Turing machine) needs to access the bits of its own text. As we have discussed, this self-referential 
action is in fact quite legal: The program is written on the tape, and the machine can read it. The program 
checks that if a macro-tile belongs to the first line of the computation zone, this macro-tile carries the correct 
bit of the program. 

How should we choose (hard- wired in the program)? We need it to be large enough so the computation 
described above (which deals with O(logA^) bits) can fit in the computation zone. Note that the computation 
never deals with the list of tiles in T or a truth table of the corresponding 4-ary relation on bit strings; all 
these objects are represented by programs that describe them. The computation needs to check simple things 
only: that numbers in the 0, ... — I range on four sides are consistent with each other, that rules for wires 
and computation time-space diagram are observed, that program bits on the next level coincide with actual 
program bits, etc. All these computations are rather simple. They are polynomial in the input size, which is 
O(logAf)), so for large they easily fit in Q.{N) available time and space. 

This finishes the construction of a self-similar aperiodic tile set. 

Remark. Let us also make a remark that will be useful later. We defined a tile set as a subset of C^, 
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where C is a set of colors. Using this definition, we do not allow different tiles to have the same colors 
on their sides. The only information carried by the tile is kept on its sides. However, sometimes a more 
general definition is preferable. We can define a tile set as a finite set T together with a mapping of T into 
C^. Elements of T are tiles, and the mapping tells us for each tile which colors it has on its four sides. 

One can easily extend the notions of macro-tiles and simulation to this case. In fact, macro-tiles are 
well suited to this definition since they already may carry information that is not reflected in the side macro- 
colors. The construction of a self-similar tile set also can be adapted. For example, we can construct a 
self-similar tile set where each tile carries an auxiliary bit, i.e., exists in two copies having the same side 
colors. Since the tile set is self-similar, every macro-tile at every level of the hierarchy also carries one 
auxiliary bit, and the bits at different levels and in different macro-tiles are unrelated to each other. Note that 
the total density of information contained in a tiling is still finite, since the density of information contained 
in auxiliary bits assigned to high-level macro-tiles decreases with level as a geometric sequence. 

3 Implementing substitution rules 

The construction of a self-similar tiling is rather flexible and can be easily augmented to get a self-similar 
tiling with additional properties. Our first illustration is the simulation of substitution rules. 

Let A be some finite alphabet and m > 1 be an integer. A substitution rule is a mapping s: ^mxm 
This mapping can be naturally extended to A-configurations. By A-configuration we mean an integer lattice 
filled with A-letters, i.e., a mapping 1? ^ A considered modulo translations. A substitution rule s applied 
to a configuration X produces another configuration s{X) where each letter a e A is replaced by an m x m 
matrix s{a). 

We say that a configuration X is compatible with substitution rule s if there exists an infinite sequence 

where Xt are some configurations. This definition was proposed in ll27l . The classical definition (used, in 
particular, in EH) is slightly different: Configuration X -.I? ^Ais said to be compatible with a substitution 
rule s if every finite part of X occurs inside of some s^"'' (a) (for some n gN and some a £ A). We prefer 
the first approach since it looks more natural in the context of tilings. However, all our results can be 
reformulated and proven (with some technical efforts) for the other version of the definition; we do not go 
into details here. 

Example 3. Let A = {0, 1}, 

.(0) = (?!), .(1) = (?>,). 

It is easy to see that the only configuration compatible with s is the chess-board coloring where zeros and 
ones alternate horizontally and vertically. 
Example 4 (Fig. S]). Let A = {0, 1}, 

.(0) = («i), .(i) = (J?). 

One can check that all configurations that are compatible with this substitution rule (called Thue-Morse 
configurations in the following) are aperiodic. (In Section |4] we will prove a stronger version of this fact.) 
One may note, for example, that every configuration compatible with this substitution rule can be uniquely 
decomposed into disjoint 2x2 blocks ( j q) and (q ^) by vertical and horizontal lines; since neighbor cells 
of the same color should be separated by one of those lines, the position of the lines is unique. Then, we can 
apply the argument from Proposition [T] (with N = 2). 
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Figure 4: Three steps of Thue-Morse substitution. 



The following theorem goes back to Mozes 1241 . It says that every substitution rule can be enforced by 
a tile set. 

Theorem 3. Let A be an alphabet and let s be a substitution rule over A. Then, there exist a tile set T and a 
mapping e : T — )• A such that 

(a) the e-image of any T -tiling is an A-configuration compatible with s; 

(b) every A-configuration compatible with s can be obtained in this way. 

A nice proof of this result for 2 x 2 substitutions is given in |[27l . where an explicit construction of a tile 
set T for every substitution rule s is provided. We prove this theorem using our fixed-point argument. In this 
way we avoid the boring technical details; but the tile sets that can be extracted from our proof contain a 
huge number of tiles. 

Proof. . Let us modify the construction of the tile set T (with zoom factor N) by taking s into account. First 
consider a very special case when 

• the substitution rule maps each A-letter into anN x N matrix (i.e., m = N) and 

• the substitution rule is easy to compute: Given a letter u € A and (/, j), we can compute the (/, j)-th 
letter of s{u) in a time much less than N. 

In this case we proceed as follows. In our basic construction every tile knows its coordinates in the 
macro-tile and some additional information needed to arrange "wires" and simulate calculations of the 
universal Turing machineH Now, in addition to this basic structure, each tile keeps two letters of A. The first 
is the label of a tile itself, and the second is the label of the N x N macro-tile it belongs to. This means that 
we keep additional 21og \A\ bits in each tile, i.e., multiply the number of tiles by |Ap. It remains to explain 
how the local rules work. We add two requirements: 

(i) The second letter is the same for neighbor tiles {unless they are separated by a border of some N xN 
macro-tile). This constraint can be easily enforced by colors on sides of tiles. We multiply the number 

■^We use this anthropomorphic terminology in the hope it makes the proof more intuitive. By saying "each tile knows its 
coordinates," we mean that the tile set is split into A'^ disjoint groups; each group corresponds to tiles that appear in one of 
positions in the macro-tiles. The correct positioning of the tiles is ensured (as we have seen) by side colors. The self-similarity 
guarantees that the same is true for macro-tiles, where the group (i.e., the coordinates in a macro-tile of the next level) is determined 
by the content of the computation zone and corresponding bits (macro-colors) on the sides. 
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of colors in our basic construction by |A|; now each color of the new construction is a pair: its first 
component is a color from the basic construction and its second component is a letter of A. The 
second component of the new color guarantees that every two neighbor tiles keep the same "father" 
letter (unless these tiles are separated by a border and do not belong to the same father macro-tile, in 
which case we do not exhibit the letter to those borders). 

(ii) The first letter in a tile is determined by the second letter and the coordinates of the tile inside the 
macro-tile, according to the substitution rule. Indeed, each tile "knows" its coordinates in a macro- 
tile. Therefore, its first letter must appear in ^(second letter) at the corresponding position. We do not 
need to extend the set of colors to enforce this property. This requirement is only a restriction on tiles. 
It explains which combinations 

(coordinates in the father macro-tile, first letter, second letter) 

can be combined in one tile of our tile setJl 

We want the new tile set to be self-similar. Therefore, we should guarantee that the requirements (i) 
and (ii) hold also for macro-tiles. Fortunately, both requirements are easy to integrate in our basic self- 
referential construction. In each macro-tile, two letters of A are encoded by strings of bits in some specially 
reserved locations on the tape of the Turing machine (simulated in the computation zone of this macro-tile). 
Requirement (i) is enforced by adding extra log |A| bits to macro-colors; to achieve (ii), a macro-tile should 
check that its first letter appears in ^'(second letter) at the required position. This is possible when s is easy 
to compute. (Knowing the coordinates and the second letter, the program computes the required value of 
the first letter and then compares it with the actual value.) 

Requirements (i) and (ii) ensure that if we take first letters from A assigned to each tile, we get an A- 
configuration that is an 5-image of some other configuration. Also (because of self-similarity) we have the 
same property on the level of macro-tiles. But this is not enough. We need to guarantee that the first letter on 
the level of macro-tiles is identical to the second letter on the level of tiles. This is also achievable. The first 
letter of a macro-tile is encoded by bits in its computation zone, and we can require that those bits match the 
second letter of the tiles at that place. (Recall that the second letter is the same across the tiles that constitute 
one macro-tile; note also that each tile "knows" its coordinates and can determine whether it is in the zone 
for the first letter in the macro-tile and which bit should be there.) By self-similarity, the same arguments 
work for macro-tiles of all levels. It is easy to see that now the tile set T has the required properties (each 
tiling projects into a configuration compatible with s and vice versa). 

However, this construction assumes that (the zoom factor) is equal to the matrix size in the substitution 
rule, which is usually not the case. In fact, usually the value of m (a parameter of the substitution rule) is 
fixed in advance, and we have to choose N, which needs to be large enough. To overcome this difficulty, we 
let be equal to m*^ for some k, and we use the substitution rule i.e., the kth iteration of s (a configuration 
is compatible with s'^ if and only if it is compatible with s). Now we do not need s to be easily computable: 
For every s, if k is large enough, the computation of / will fit into the available space (exponential in k). □ 

natural question arises: What does it mean to add a letter that is determined by other information? Adding a letter means 
that we create \A\ copies of the same tile (with different letters); but then, the restriction prohibits all of them except one, so is there 
any change at all? In fact, the actual change is occurring on higher levels: We want the macro-tiles to have both letters written on 
the tape as binary strings (in some prearranged places). This is important for checking consistency between levels. 
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4 The Thue-Morse lemma and strongly aperiodic tile sets 



Let a > be a real number. We say that a configuration U : ^ A is a-aperiodic if for every nonzero 
vector r G there exists A'^ such that in every square whose side is at least A'^ the fraction of points x such 
that U{x) ^U{x + T) exceeds a. 

Remark. If U is a-aperiodic, then the Besicovitch distance between U and any periodic pattern is at 
least a/2. (The Besicovitch distance between two configurations is defined as limsup^t/ft?, where dfj is the 
fraction of points where two configurations differ in the N x N centered square. It is easy to see that the 
distance does not depend on the choice of the center point.) 

Theorem 4. There exists a tile set 1 such that l-tilings exist and every i-tiling is a-aperiodic for every 
a < 1 /4. 

Proof. The proof is obtained by applying Theorem [3] to the Thue-Morse substitution rule T (Example 4). 
Let C be a configuration compatible with T . We have to show that C is a-aperiodic for every a < 1/4. It is 
enough to prove that every configuration compatible with the Thue-Morse substitution rule is a-aperiodic. 

Informally, we can reduce the statement to the one-dimensional case, since Thue-Morse substitution is 
an xor-combination of two one-dimensional substitutions. Here are the details. 

Consider a one-dimensional substitution system with two rules, — ?■ 01 and 1 — )• 10. Applying these 
rules to and 1 , we get 

0^01 ^0110^01101001 ^ 
1 ^ 10 ^ 1001 100101 10 ^ . . . 

Let a„ and b„ be the nth terms in these sequences (ao = 0, ai = 01, . . . , Z>o = 1, = 10, etc.); it is easy to 
see that an+i = a^bn and bn+\ = bnan. 

For some n we consider the xor-combination of these strings, where the (/, 7)-th bit is xor of the /th 
bit in the first string and the jth bit in the second string. Since bn is a bitwise negation of a„, we get only 
two different combinations (one obtained from two copies of a„ or two copies of b„, and the other obtained 
from different strings), which are bitwise opposite. It is easy to see (e.g., by induction) that these two square 
patterns are images of and 1 after n steps of two-dimensional Thue-Morse substitution. 

To prove the statement for aperiodicity of the Thue-Morse configuration, we start with an estimate for 
(one-dimensional) aperiodicity of a„ and b,^ : 

Lemma 1 (folklore). For any integer u > and for any n such that u < \an\l^ the shift by u steps to the 
right changes at least \an\/A positions in an and leaves unchanged at least \an\/^ positions. {Formally, in 
the range 1, . . . , (2" — u) there exist at least (1/4) • 2" positions i such that the ith and the {i + u)-th bits in 
an coincide and at least (1 /4)2" positions where these bits differ.) 

Proof. String a„ can be represented as abbabaab, where a = a„_3 and b = ^,,-3. One may assume without 
loss of generality that u > \a\ (or otherwise we apply Lemma[I]separately to the two halves of a„). Note that 
ba appears in the sequence twice: once preceded by a copy of a and once preceded by a copy of b. Since 
these copies have opposite bits, the shifted bits match in one of the cases and do not match in the other one. 
The same is true for ab, which appears preceded both by a and b. □ 

Now consider a large N x N square in a two-dimensional Thue-Morse configuration and some shift 
vector T. We assume that A'^ is much bigger than components of T (since we are interested in the limit 
behavior as N ^ °°). Moreover, we may assume that some power of 2 (let us call it m) is small compared to 
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A'^ and large compared to T . Then, the N xN square consists of a large number of mxm Thue-Morse blocks 
and some boundary part (which can be ignored by changing a slightly). Then, we can consider each mxm 
block separately to estimate the fraction of positions that are changed by the T-shift. If T is horizontal or 
vertical, we can use the statement of the lemma directly: At least 1/4 of all positions are changed. If not (i.e., 
if the shift has two nonzero components), we are interested in the probability of some event that is an xor 
combination of two independent events with probabilities in the interval (1/4,3/4). It is easy to check that 
such an event also has probability in (1/4,3/4) (in fact, even in (3/8,5/8), but we do not need this stronger 
bound). 

Theorem |4]is proved. □ 

In fact, the bound 1/4 can be replaced by 1/3 if we use a more professional analysis of the Thue-Morse 
sequence (see, e.g., |[33l ). However, if we want to get the strongest result of this form and make the bound 
close to 1 , this substitution rule does not work. We can use some other rule (in a bigger alphabet) as Pritykin 
and Ulyashkina have shown |28 1, but we prefer to give another construction with variable zoom factors (see 
Section|6]). 



5 Variable zoom factor 

The fixed-point construction of an aperiodic tile set is flexible enough and can be used in other contexts. For 
example, the "zoom factor" N could depend on the level. This means that instead of one tile set T we have 
a sequence of tile sets Tq, Ti , T2, . . ., and instead of one zoom factor A'^ we have a sequence of zoom factors 
No,Ni,.... The tile set Tq simulates Ti with zoom factor A'o, the tile set Ti simulates T2 with zoom factor Ni, 
etc. 

In other words, To-tilings can be uniquely split (by horizontal and vertical lines) into No x Nq macro-tiles 
from some list, and the macro-tiles in this list are in one-to-one correspondence (which respects matching 
rules) with Zi. So To-tihngs are obtained from Ti -tilings by replacing each Ti-tile by the corresponding 
To-macro-tile, and each To-tiling has a unique reconstruction. 

Further, every Ti -tiling can be split into macro-tiles of size A'^i x Ni that correspond to T2-tiles. So after 
two steps of zooming out, every To-tiling looks like a T2-tiling; only a closer look reveals that each T2-tile is 
in fact a Ti -macro-tile of size Ni xNi, and an even closer look is needed to realize that every Ti-tile in these 
macro-tiles is in fact a To-macro-tile of size Nq x Nq. 

For such a To-tiling we can say that it consists of level 1 macro-tiles of size Nq x Nq (isomorphic to Ti); 
at the same time it consists of level 2 macro-tiles of size NqNi x NqNi (isomorphic to T2), etc. 

This is what we want to achieve (together with other things needed to get the tile set with desired 
properties). How do we achieve this? Each macro-tile should "know" its level: A macro-tile that simulates 
a Tjfc-tile and is made of T^^-i -tiles, should have k in some place on the tape of the Turing machine simulated 
in this macro-tile. To make this information consistent between neighbors, k is exhibited as a part of the 
macro-colors at all four sides. The value of k is used for the computations. Macro-colors on the sides of a 
macro-tile encode the coordinates of this macro-tile inside its father, and the computation should check that 
they are consistent modulo Nt (i.e., the x coordinate on the right side should be equal to the x coordinate on 
the left side plus 1 modulo A^^:, etc.). This means that A^^ should be computable from k; moreover, it should be 
computable fast enough to fit into the computation zone (which carries only &{Nk-i ) steps of computation). 
After Nk is computed, there should be enough time to perform the arithmetic operations modulo Nk, and so 
on. 
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Let us look at these restrictions more closely. We need to keep both k and the coordinates (modulo Nk) 
on the tape of level k macro-tiles, and \ogk + 0{\ogNk) bits are required for that. Both log/c and logNk 
should be much less than Nt-i, so all the computations could fit in the available time frame. This means that 
Nk should not increase too fast or too slowly. Say, Nk = logfc is too slow (in this case k occupies almost all 
available space in macro-tiles of level k—l, and we do not have enough time even for simple computations), 
and Nk = 2^* ' is too fast (in this case logNk is too large compared to time and space available on the 
computation zone of a macro-tile of level k). Also we need to compute Nk when k is known, so we assume 
that not only the size of Nk (i.e., logNk) but also the time needed to compute it (given k) is small compared to 
Nk-i- These restrictions still allow many possibilities: Say, Nk could be proportional to \/^, k, 2^, 2^^ \or k\ 
Note that we say "proportional" since Nk needs to be reasonably large even for small k (we need some space 
in the macro-tile for wires and all our estimates for computation time are not precise but only asymptotic, 
so we need some reserve for small k). 

There is one more problem: It is not enough to ensure that the value of k is the same for neighbor macro- 
tiles. We also need to ensure that this value is correct, i.e., is 1 for level 1 macro-tiles made of To-tiles, is 
2 for level 2 macro-tiles made of Ti -tiles, etc. To guarantee this, we need to compare somehow the level 
information that is present in a macro-tile and its sons. Using the anthropomorphic terminology, we say that 
each macro-tile "knows" its level, since it is explicitly written on its tape, and this is, so to say, "conscious" 
information processed by a computation in the computation region of the macro-tile. One may say also that 
a macro-tile of any level contains "subconscious" information ("existing in the mind but not immediately 
available to consciousness" |[34ll). This is the information that is conscious for its sons, grandsons, and so 
on (all the way down to the ground level). The problem is that the macro-tile cannot check consistency 
between conscious and subconscious information since the latter is unavailable (the problem studied by 
psychoanalysis in a different context). 

The solution is to check consistency in the son, not in the father. Every tile knows its level and also 
knows its position in its father. So it knows whether it is in the place where its father should keep level bits, 
and it can check whether indeed the level bit that its father keeps in this place is consistent with the level 
information the tile has. (In fact we used the same trick when we simulated a substitution rule: A check 
that the father letter of a tile coincides with the letter of the father tile is done in the same way.) The careful 
reader will also note here that now the neighbor tiles will automatically have the same level information, so 
there is no need to check consistency between neighbors. 

This kind of "self-similar" structure with variable zoom factors can be useful in some cases. Though 
it is not self-similar according to our definition, one can still easily prove that any tiling is aperiodic. Note 
that now the computation time for the Turing machine simulated in the central part increases with level, and 
this can be used for a simple proof of undecidability of the domino problem. The problem in the standard 
proof (based on the self-similar construction with fixed zoom factor) is that we need to place computations 
of unbounded size into this self-similar structure, and for that we need special geometric tricks (see LL.3J). 
With our new construction, if we want to reduce an instance of the halting problem (some machine M) to 
the domino problem, we add to the program embedded in our construction the parallel computation of M on 
the empty tape; if it terminates, this destroys the tiling. 

In a similar way we can show that the existence of a periodic tiling is an undecidable property of a 
tile set, and, moreover, the tile sets that admit periodic tilings and tile sets that have no tilings form two 
inseparable sets (another classical result; see |[T6l ). Recall that two sets A and B are called (computably) 
inseparable if there is no computable set C such that A C C and B n C = 0. 

Here is an example of a more exotic version of the latter result (which probably is of no interest in itself 
but just serves as an illustration of the technique). We say that a tile set T is m-periodic if T-tilings exist and 
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for each of them the set of periods is the set of all multiples of m, in other words, if the group of periods is 
generated by (0,m) and (ni,0). Let E [respectively O] be all m-periodic tile sets for all even m [respectively 
odd m]. 

Theorem 5. The sets E and O are inseparable enumerable sets. 

Proof. It is easy to see that the property "to be an m-periodic tile set" is enumerable (both the existence of 
an m-periodic tiling and enforcing periods (m,0) and (0,m) are enumerable properties). 

It remains to reduce some standard pair of inseparable sets (say, machines that terminate with output 
and 1) to (£■, O). It is easy to achieve this by using the technique explained above. Assume that the numbers 
Nk increase, being odd integers as long as the computation of a given machine does not terminate. When 
and if it terminates with output [respectively 1], we require periodicity with odd [respectively even] period 
at the next level. □ 

Another application of a variable zoom factor is the proof of the following result obtained by Lafitte and 
Weiss (see [21J) using a Turing machine simulation inside a Berger-Robinson consti^uction. 

Theorem 6. Let f be a total computable function whose arguments and values are tile sets. Then, there 
exists a tile set T that simulates a tile set /(t). 

Here we assume that some computable encoding for tile sets is fixed. Since there are no restrictions on 
the computation complexity of /, the choice of the encoding is not important. 

Proof. Note that for identity function / this result provides the self-simulating tile set of Section 12.31 To 
prove it in the general case, we may use the same kind of fixed-point technique. However, there is a problem: 
The computation resources inside a tile are limited (by its size) while time needed to compute / can be large 
(and, moreover, depends on the tile size). 

The solution is to postpone the simulation to large levels. If a tile set Tq simulates Ti , which simulates 
T2, which simulates, etc., up to t„, then Tq simulates t„, too. Therefore we may proceed as follows. 

We use the construction explained above with a variable zoom factor. Additionally, at each level the 
computation starts with a preliminary step that may occupy up to (say) half of the available time. On this 
step we read the program that is on the tape and convert it into the tile set. (Recall that each program 
determines some tile set Tq such that To-tilings can be uniquely split into macro-tiles, and this program is 
written on a read-only part of the tape simulated in the computation zone of all macro-tiles, as was explained 
in Section [2!2l ) Then, we apply / to the obtained tile set. 

This part of the computation checks also that it does not use more than half of the available time and 
that the output is small enough compared to the macro-tile size. If this time turns out to be insufficient or 
the output is too big, this part is dropped and we start a normal computation for the variable zoom factor, as 
explained above. In this case, the zoom factor on the next level should be greater than the zoom factor on the 
current level (e.g., we may assume Nk = Ck for some large enough constant C). However, if the time is large 
enough and the result (the list of tiles that corresponds to /'s output) is small compared to the macro-tile 
size, we check that the macro-tile (of the current level) belongs to the tile set computed. The hierarchy of 
macro-tiles stops at this level. The behavior of macro-tiles at this level depends on /: They are isomorphic 
to /(To)-tiles. Since the program is the same at all levels and the computation of / should be finite (though 
may be very long), at some (big enough) level the second possibility is activated, and we get a macro-tile 
set isomorphic to /(t), where T is the tile set on the ground level. □ 
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Another application of the variable zoom factor technique is the construction of tile sets with any given 
computable density. Assume that a tile set is given and, moreover, that all tiles are divided into two classes, 
say, A-tiles and B-tiles. We are interested in a fraction of A-tiles in a tiling of an entire plane or its large 
region. If the tile set is flexible enough, this fraction can vary. However, for some tile sets this ratio tends 
to a limit value when the size of a tiled region increases. This phenomenon is captured in the following 
definition: We say that tile set T divided into A- and B-tiles has a limit density a if for every £ > there 
exists N such that for any n> N the fraction of A-tiles in any tiling of the nxn square is between a — e and 
a + e. 

Theorem 7. (i) If a tile set has a density a, then a is a computable real number in [0, 1]. (ii) Any computable 
real number <X E [0, 1] is a density of some tile set. 

Proof. The first part of the proof is a direct corollary of the definitions. For each n we can consider all tilings 
of the nxn square and look for the minimal and maximal fractions of A-tiles in them. Let us denote the 
minimal and maximal fractions by m„ and M„ respectively. These rational numbers are computable given n. 
It is easy to see that the limit frequency (if it exists) is in the interval [m„,M„]. Indeed, in a large square split 
into squares of size nxn the fraction of A-tiles is between ni„ and M„, being at the same time arbitrarily 
close to a. Therefore, a is computable (to get its value with precision e, we increase n until the difference 
between M„ and m„ becomes smaller than e). 

It remains to prove (ii). Since a is computable, there exist two computable sequences of rational num- 
bers and r,- that converge to a in such a way that 

Our goal will be achieved if macro-tiles of the first level have density of either l\ or r\ , macro-macro-tiles 
of the second level have density of either h or r2, and so on. Indeed, each large square can be split into 
macro-tiles (and the border that does not change the density much), so in any large square the fraction of 
A-tiles is (almost) in The same argument works for macro-macro-tiles, etc. 

However, this plan cannot be implemented directly. The main difficulty is that the computation of /, and 
r; may require a lot of time whereas the computation abilities of macro-tiles of level / are limited. (We use 
variable zoom factors, e.g., we may let Nk = Ck, but they cannot grow too fast.) 

The solution is to postpone the switch from densities and r, to densities and r,_|_i to the higher level 
of the hierarchy where the computation has enough time to compute all these four rational numbers and find 
out in which proportion Z,- and r,-tiles should be mixed in Z;+i- and r;+i -tiles. (We need the denominators 
in both fractions and r,+i to be equal to the number of /-level macro-tiles in the (/ + 1) -level macro-tile, 
but this restriction can always be satisfied by a slight change in the sequences l^ and r^, which leaves a 
unchanged.) So, we allocate, say, the first half of the available time for a controlled computation of all these 
values; if the computation does not finish in time, the densities for the next level are the same as for the 
current level. (We require that all macro-tiles in the same father tile have the same density, either /, or r,). If 
the computation terminates in time, we use the result of the computation to have two types of the next level 
tiles: one with density and one with density r,+i. They are made by using prescribed amounts of 
and r,-tiles. (Since each tile knows its coordinates, it can find out whether it should be of the first or second 
type.) This finishes the construction. □ 

6 Strongly aperiodic tile sets revisited 

In Section |4] we constructed a tile set such that every tiling is a-aperiodic for every a < 1/4. Now we want 
to improve this result and construct a tile set such that every tiling is, say, 0.99-aperiodic (here 0.99 can be 
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replaced by any constant less than 1). It is easy to see that this cannot be achieved by the same argument, 
with Thue-Morse substitutions, nor with any substitutions in a two-letter alphabet; we need a large alphabet 
to make the constant close to 1 . 

It is possible to achieve 0.99-aperiodicity with a carefully chosen substitution rule (in a bigger alphabet), 
as recently proposed by Pritykin and Ulyashkina 1281 . by just applying Theorem |3] (similarly to the argument 
for the Thue-Morse substitution presented in Section HJl. In this section we present an alternative proof of 
this result. We exploit substitution rules with variable zoom factors (and different substitutions on each 
level) and use the idea of an error-correcting code. 

Instead of one single alphabet. A, we now consider an infinite sequence of finite alphabets, Ao,Ai ,A2, . . .; 
the cardinality of will grow as k grows. Then, we consider a sequence of mappings: 

5l.Ai— >Aq , 52.A2— ^Aj , 53.A3— ^Aj 

where No,Ni,N2,- ■ ■ are some positive integers (zoom factors); Nt will increase as k increases. 

Then, we can compose these mappings. For example, a letter z in A2 can be first replaced by an Ni x A^i 
square ^2(2) filled by A 1 -letters. Then, each of these letters can be replaced by an A^o x Nq square filled by 
Ao-letters according to ^'i, and we get an NqNi x NqNi square filled by Ao-letters; we denote this square by 
^li^iiz)) (slightly abusing the notation). 

We call all this (i.e., the sequence of A^, Nt, s^) a substitution family. Such a family defines a class of 
Ao-configurations compatible with it (in the same way as in Section[3]). Our plan is to construct a substitution 
family such that 

• every configuration compatible with this family is 0.99-aperiodic, and 

• there exists a tile set and projection of it onto Aq such that only compatible configurations (and all 
compatible configurations) are projections of tilings. 

In other words, we use the same argument as before (proving Theorem H]) but use a substitution family 
instead of one substitution rule. This substitution family will have two special properties: 

A. Symbols used in different locations are different. This means that Ayt-letters that appear in a given 

position of the squares Sk+i{z) for some z G A^^i never appear in any other places of these squares 
(for any z); thus, set A^ is split into Nk x Nk disjoint subsets used for different positions in Nk x Nt 
squares. 

B. Different letters are mapped to squares that are far away in terms of Hamming distance. This means that 

if z,w G A<;+i are different, then the Hamming distance between images and i'i+i(w) is large: 

The fraction of positions in the Nk x Nt square, where 5';+i (z) and Si+i (w) have equal letters does not 
exceed St- 

Here £, will be a sequence of positive reals such that X^,>o £, < 0.01. 

This implies that composite images of different letters are also far apart. For example, the fraction of 
positions in the A'^oA'^i x A'^oA'^i square where s\ (^2(2)) and ^'i {s2{w)) coincide does not exceed £0 + £1 < 0.01. 
Indeed, in S2{z) and S2{w) we have at most £1 -fraction of matching letters; these letters generate £i-fraction 
of matching Ao-letters on the ground level; all other (nonmatching) pairs add £o-fraction. In fact, we get 
even a stronger bound 1 — (1 — £o)(l — £1). 
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For the same reasons, if we take two different letters in and then drop to the ground level and obtain 
two squares of size NqNi • --Nk-i x A'^oA'^i • • • A'it-i filled by Ao-letters, the fraction of coincidences is at most 
eo + --- + ek-i <0.01. 

This property of the substitution family implies the desired property: 

Lemma 2. If an Ao-configuration U is compatible with a substitution family having properties (A) and (B), 
then U is 0.99-aperiodic. 

Proof. Consider a shift vector T. If T is not a multiple of Nq (one of the coordinates is not a multiple of Nq), 
then property (A) guarantees that the original configuration and its T shift differ everywhere. Now assume 
that r is a multiple of A'q. Then, T induces a (r/A'^[))-shift of an A i -configuration Ui that is an ^i-preimage 
ofU. If r is not a multiple of NqNi , then T /Nq is not a multiple of A'^i and for the same reason this (r /Nq)- 
shift changes all the letters inUi. Different letters in Ai are mapped to A'q x A^o squares that coincide in at 
most £o-fraction of positions. 

If r is a multiple of NqNi but not NoN\N2, we get a.T / {NqNi) shift of A2-configuration U2 that changes 
all its letters, and different letters give squares that are 1 — (Cq + £1 ) apart. The same argument works for the 
higher levels. □ 

It remains to construct a substitution family that has properties (A) and (B) and can be enforced by a 
tile set. Property (B) (large Hamming distance) is standard for coding theory, and the classical tool is the 
Reed-Solomon code. 

Let us recall the idea of the Reed-Solomon code (for details see, e.g., BJ). The codewords of the 
Reed-Solomon code are tables of (values of) polynomials of bounded degree. More precisely, we fix some 

finite field of size q and an integer d > 0. Let p{x) = ao + a\x-\ +ad-\x''^^ be a polynomial over 

¥q of degree less than d. Then the codeword corresponding to p{x) (i.e., the encoding of the sequence 
ao,... ,ad-\) is a vector in (F^)^ (i.e., a sequence of q elements of the field), which consists of the values of 
this polynomial computed at all points x G F^. Thus, for given parameters d and q, the code consists of q'^ 
codewords. Since two polynomials of degree less than d can coincide in at most {d — \ ) points, the distance 
between any two codewords is at least q — d + \. Of course, this construction can be used even if the desired 
length of the codewords is not a size of any finite field; we can choose a slightly larger field and use only 
part of its elements. 

Now we embed these codes in a family of substitution rules. First, let be a finite field (the size of 
which is specified below) and let A^ be equal to B/t x {0, 1, . . . jA'yt — 1} x {0, 1, . . . jA't — 1}; let us agree that 
we use letters {b,i,j) only in the (/, 7)-position of an Sk+i-ivaa.ge. This trivially implies requirement (A). 

Then, we construct a code that encodes each A,t+i -letter w by a string of length made of Bjt-letters 
(arranged in a square); adding the coordinates, we get the 5^+1 -image of w. Thus, we need a sequence of 
codes: 

51 : Ai — ^ B^°^^°, such that (w) and 5'i (w') coincide at most in £0 fraction 

of all positions (if w / w'), 

52 : A2 — )• B^' , such that S2{w) and S2{w') coincide at most in £1 fraction 

of all positions (if w / w'), 

To satisfy requirement (B), we need a code with the Hamming distance (between every two codewords) at 
least (1 — Sk)N^. The Reed-Solomon code works well here. The size of the field can be equal to the length 
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of the codeword, i.e., Let us decide that Nk is a power of 2 and the size of the field is exactly N'^. 
(There are fields of size 2' for every t = 1,2,3,...; we could also use Zt/pZ, for prime p of an appropriate 
size.) To achieve the required code distance, we use polynomials of degree less than EuN]^- The number of 
codewords (polynomials of degree less than S^A^f ) is at least 2^*^* (even if we use only polynomials with 
coefficients and 1). This is enough if 

|A,+i|<2^*^*. 

Recalling that \Ak+\ \ = \Bk+\ \ -Nk+x ^^'^ J^k+\ is a field of size A'f^^j, we get the inequality 

A^^i < 2^*^* , or 41ogA^,+i < EuNI 

Now let Nk = 2*+^^ for some constant c; we see that for large enough c this inequality is satisfied for Ek with 
sum less than 0.01 (or any other constant), since the left-hand side is linear in k while the right-hand side is 
exponential. 

Now it remains to implement all this scheme using tiling rules. As we have discussed, the zoom factor 
Nk = 2^^+' is acceptable for the construction. This factor leaves enough space to keep on the tape two 
substitution letters (for the tile itself and its father tile), since these letters require linear size (in k). Moreover, 
we have enough time to perform the computations in the finite fields needed to construct the error-correction 
code mappings. Indeed, in a k-\&ve\ macro-tile we are allowed to use exponential (in the bit size of the field 
element) time. Recall that one can operate with elements in the field of size T' using polynomial (in r) time; 
to this end, we need to construct some irreducible polynomial p of degree r over the field of two elements 
and then perform arithmetic operations (on polynomials) modulo p. All these operations can be done by 
deterministic algorithms in polynomial time (see, e.g., |[23l ). Thus, we can reuse here the construction of 
the proof of Theorem [3l 

The construction above works with every constant a < 1 instead of 0.99. So, we get a stronger version 
of Theorem m 

Theorem 8. For every a < \ there exists a tile set z such that z-tilings exist and every z-tiling is a-aperiodic. 

Remark. We can also get an a-aperiodic tile set (for every a < 1) as a corollary of the result of the 
next section; indeed, we construct there a tile set such that any tiling embeds a horizontal sequence with 
high-complexity substrings, and such a sequence cannot match itself well after a shift (in fact, to get a- 
aperiodicity we would need to replace a binary alphabet by a larger finite alphabet in this argument). We 
can superimpose this with a similar 90° -rotated construction; then, any nonzero translation will shift either 
a vertical or a horizontal sequence and therefore change most of the positions. Note that in this way we 
can also get a tile set that is a-far from every periodic pattern (a slightly different approach to defining 
"strong aperiodicity"). However, the arguments used in Section [T] are more complicated than the proof of 
this section. So we preferred to present here a simpler and more direct proof of Theorem [H 

7 Tile sets with only complex tilings 

In this section we provide a new proof of the following result from Q : 

Theorem 9. There exist a tile set T and constants ci > and C2 such that z-tilings exist and in every z-tiling 
T every N x N square has Kolmogorov complexity at least ciN — C2. 
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Here Kolmogorov complexity of a tiled square is the length of the shortest program that describes this 
square. We assume that programs are bit strings. Formally speaking, Kolmogorov complexity of an object 
depends on the choice of programming language. (Consult [31 1 for the definition and properties of Kol- 
mogorov complexity.) However, in our case the choice of programming language does not matter, and you 
may think of Kolmogorov complexity of an object as the length of the shortest program in your favorite 
programming language that prints out this object. We need to keep in mind only two important properties 
of Kolmogorov complexity. First, the Kolmogorov complexity function is not computable, but it is upper 
semicomputable. This means that there is an algorithm that for a given n enumerates all objects that have 
complexity less than n. The enumeration can be done by a brute force search over all short descriptions. We 
cannot say in advance which programs stop with some output and which do not, but we can run all programs 
of length less than n in parallel, and enumerate the list of their outputs, as some programs terminate. Second, 
any computable transformation (e.g., the change of encoding) changes Kolmogorov complexity at most by 
0(1). We refer to |7| for a discussion of Theorem |9] (why it is optimal, why the exact value of c\ does not 
matter, etc.) and other related results. 

7.1 A biinfinite bit sequence 

Proof. We start the proof in the same way as in fT| : We assume that each tile keeps a bit that propagates 
(unchanged) in the vertical direction. Then, any tiling contains a biinfinite sequence of bits ft), (where / € Z). 
Any N xN square contains an N-bit substring of this string, so if (for large enough N) every A'^-bit substring 
of CO has complexity at least ciN for some fixed ci, we are done. 

We say that a sequence ft) has Levin's property if every A^-bit substring x of ftj has complexity Q.{N). 
Such a biinfinite sequence indeed exists (see [7J; another proof can be obtained by using the Lovasz local 
lemma; see OOl ). So our goal is to formulate tiling rules in such a way that a correct tiling "ensures" that 
the biinfinite sequence embedded in it indeed has this property. 

The set of all "forbidden" binary strings, i.e., strings x such that K{x) < c\ \x\ — C2 (where K{x) denotes 
the Kolmogorov complexity of x, and \x\ denotes the length of x) is enumerable: There is an algorithm that 
generates the list of all forbidden substrings. It would be nice to embed into the tiling a computation that 
runs this algorithm and compares its output strings with the substrings of co; such a computation blows up 
(creates a tiling error) if a forbidden substring is found. 

However, there are several difficulties. 

• Our self-similar tiling contains only finite computations. The higher is rank k, the bigger are the 
k-level macro-tiles, and the longer computations they can contain. But at any level the computation 
remain finite. This is a problem since for a given string x we do not know a priori how much time the 
shortest program for x uses, so we never can be sure that the Kolmogorov complexity of x is large. 
Hence, each substring of (O should be examined in computations somehow distributed over infinitely 
many macro-tiles. 

• The computation at some level deals with bits encoded in the cells of that level, i.e., written on the 
computation tape. So the computation cannot access the bits of the sequence (that are "deep in the 
subconscious") directly and some mechanism to dig them out is needed. 

Let us explain how to overcome these difficulties. 
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7.2 Delegation of bits 



A macro-tile of level kis a square whose side is Lj^= Nq-Ni ■■ ■ N^- 1 , so there are bits of the sequence that 
intersect this macro-tile. Let us delegate each of these bits to one of the macro-tiles of level k it intersects. 
(We do it for every k.) Note that the macro-tile of the next level is made of Nk x Nk macro-tiles of level k. 
We assume that Nk is much bigger than (see the end of this subsection for more details on the choice of 
Nk)', this guarantees that there are enough macro-tiles of level k (in the next level macro-tile) to serve all bits 
that intersect them. Let us decide that the ith (from bottom to top) macro-tile of level k in a. {k+ 1) -level 
macro-tile serves (consciously knows, so to say) the ith bit (from the left) in its zone (see Fig. |5]l. Since 
Nk ^ Lk, we have many more macro-tiles of level k (inside some macro-tile of level k+ I) than needed to 
serve all bits. So some fc-level macro-tiles remain unused. 
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Figure 5: Bit delegation. 

Thus, each bit (each vertical line) has a representative on every level — a macro-tile that consciously 
knows this bit. However, we need some mechanisms that guarantee that this information is indeed true 
(i.e., consistent on different levels). On the bottom level this is easy to achieve, since the bits are available 
directly. 

To guarantee the consistency we use the same trick as in Section [3j At each level a macro-tile keeps not 
only its own bit but also its father's bit, and makes necessary consistency checks. Namely, each macro-tile 
knows (has on its computation tape): 

• the bit delegated to this macro-tile; 

• the coordinates of this macro-tile in its father macro-tile (which are already used in the fixed-point 
construction); note that the j-coordinate is at the same time the position of the bit delegated to this 
macro-tile (relative to the left boundary of the macro-tile); 

• the bit delegated to the father of this macro-tile; and 

• the coordinates of the father macro-tile in the grandfather macro-tile. 

This information is subject to consistency checks: 
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• The information about the father macro-tile should coincide with the same information in the neighbor 
tiles (unless they have a different father, i.e., one of the coordinates is zero). 

• If the bit delegated to the father macro-tile is from the same vertical column as the bit delegated for 
this macro-tile, these two bits should match. 

• If the macro-tile occupies a place in its father macro-tile where the bit delegated to the father or some 
bits of the father's coordinates (inside the grandfather macro-tile) are kept, then this partial information 
on the father level should be should be consistent with the information about father coordinates and 
bit. 

These tests guarantee that the information about the father is the same in all brothers, and some of these 
brothers (which are located on the father tape) can check it against actual father information; at the same 
time some other brother (that has the same delegated bit as the father) checks the consistency of the delegated 
bits information. 

Note that this scheme requires that not only logA^jt but also logA/^t+i is much less than A'yt-i. This 
requirement, together with the inequality = NoNi ■ ■ -Nk-i < Nk (discussed earlier), is satisfied if Nk = Q^ , 
where 2 is a large enough constant (which is needed also to make macro-tiles of the first level large enough) 
and c> 2 (so 1 + c + H h c*^"^ < c*). 

Later, in Section[T3l the choice of c has to be reconsidered: We need 2 < c < 3 to achieve error correction, 
but for our current purposes this does not matter. 

7.3 Checking bit blocks 

We explained how macro-tile of any level can have true information about one bit (delegated to it). However, 
we need to check not bits but substrings (and artificially introduce a tiling error if a forbidden string appears). 
Note that it is acceptable to test only very short substrings compared to the macro-tile size {Nk)- If this test 
is done on all levels, this restriction does not prevent us from detecting any violation. (Recall that short 
forbidden substrings can appear very late in the generation process, so we need computation at arbitrary 
high levels for this reason, too.) 

So we need to provide more information to macro-tiles. This can be done in the following way. Let us 
require that a macro-tile contains not one bit but a group of bits to check: a group of bits that starts at the 
delegated bit and has length depending on the level k (and growing very slowly with k; e.g., log log log is 
slow enough). If this group is not completely inside a macro-tile (i.e., it extends out of the region occupied 
by the macro-tile), we ignore the outstanding part. 

Similarly, a macro-tile should have this information for the father macro-tile (even if the bits are outside 
its own region). This information about the father macro-tile should be the same for brothers (which is 
checked by matching macro-colors of neighboring brothers). Also each macro-tile checks (on its compu- 
tational zone) that the value of its own delegated bit is coherent with its father's string of bits to check: A 
macro-tile knows its coordinate in the father macro-tile and the coordinates of the father tile in the grandfa- 
ther, so it knows whether its delegated bit makes a part of the father's bits to check. 

The computation in the computation zone generates the list of all forbidden strings (strings that have too 
small Kolmogorov complexity) and checks the generated forbidden strings against all the substrings of the 
group of bits available to this macro-tile. This process is bounded in time and space, but this does not matter 
since every string is considered on a high enough level. 

Our construction has a kind of duplication: We first guarantee the consistency of information for indi- 
vidual bits, and then, we do the same for substrings. The first part of the construction is still needed, since 
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Figure 6: Degenerate case: An infinite vertical line is a boundary between macro-tiles of all levels. 



we need arbitrarily long substrings to be checked by macro-tiles (of high enough level); thus delegation 
of substrings cannot start from the ground level where the tile size is limited, so we need to deal with bits 
separately. 

7.4 Last correction 

The argument just explained still needs some correction. We claim that every forbidden string will be 
detected at some level where it is short enough compared to the level parameters. However, some strings 
may never become a part of one macro-tile. Imagine that there is some vertical line that is a boundary 
between macro-tiles of all levels (so we have bigger and bigger tiles on both sides, and this line is still the 
boundary between them; see Fig. [6l). Then, a substring that crosses this line will never be checked and 
therefore we cannot guarantee that it is not forbidden. 

There are several ways to get around this problem. One can decide that each macro-tile contains in- 
formation not only about blocks inside its father macro-tile but in a wider region (say, three times wider, 
including "uncle" macro-tiles); this information should be checked for consistency between "cousins", too. 
This trick (extending zones of responsibility for macro-tiles) will be used later in Section [8] 

But to prove Theorem|9]a simpler solution is enough. Note that even if a string on the boundary is never 
checked, its parts (on both sides of the boundary) are, so their complexity is proportional to their length. 
One of the parts has length at least half of the original length, so we still have a complexity bound, though 
the constant will be twice smaller. 

This finishes the proof of Theorem |9l □ 

8 Subshifts 

The analysis of the proof in the previous section shows that it can be divided into two parts. We defined 
forbidden strings as bit strings that are sufficiently long and have complexity at most a • (length). We started 
by showing that biinfinite strings without forbidden factors (substrings) exist. Then, we constructed a tile 
set that embeds such a biinfinite string in every tiling. 
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The second part can be separated from the first one, and in tliis way we get new proofs for some results 
of Simpson (3T\ and Hochman | TSl about effectively closed subshifts. 

Fix some alphabet A. Let F be a set of A-strings. Consider a set Sf of all biinfinite A-sequences that have 
no factors (substrings) in F. This set is a closed one-dimensional subshift over A, i.e., a closed shift-invariant 
subset of the space of all biinfinite A-sequences. If the set F is (computably) enumerable, Sp is called an 
effectively closed one-dimensional subshift over A. Iff is finite, Sp is called a subshift of finite type. 

We can define two-dimensional subshifts in a similar way. More precisely, let F be a set of two- 
dimensional patterns (squares filled with A-letters). Then, we can consider a set Sf of all A-configurations 
(= mappings — )• A) that do not contain any pattern from F. This is a closed shift-invariant set of A- 
configurations (= two-dimensional closed subshift over A). If F is (computably) enumerable, Sf is called a 
two-dimensional effectively closed subshift over A. If F is finite, Sf is called a two-dimensional subshift of 
finite type. 

As we have mentioned, subshifts of finite type (interpreted as local rules) are closely related to tilings. 
Each tile set determines a subshift where A is the set of tiles and forbidden patterns are pairs of neighbor 
nonmatching tiles. Going in the other direction, we should be more careful. A tile set in our definition 
cannot contain two different tiles with exactly the same colors. This leads to some problems. For example, 
the full shift over a two-letter alphabet (i.e., the set of all biinfinite sequences over a two-letter alphabet) 
cannot be represented by a set of two tiles. However, any subshift of finite type can be represented by some 
tile set. More precisely, for any subshift S of finite type over alphabet A there is a tile set T and some mapping 
£■ : T — )■ A such that E induces a bijection between the set of all T-tilings and the set of all configurations of 
the subshift S: we apply E pointwise to a T-tiling and get some A-configuration from the subshift; for each 
configuration in the subshift there exists exacdy one T-tihng in the f-preimage of this configuration. Such 
a tile set can be constructed as follows: Tiles are squares of large enough size filled by A-letters (a square 
with no patterns forbidden for this subshift); each tile represents a part of the configuration, and side colors 
are used to ensure that neighbor tiles overlap correctly. The mapping extracts (say) the central letter from a 
square. 

Thus, subshifts of finite type and tilings are essentially the same kind of objects. On the other hand, the 
effectively closed subshifts of dimension make a more general class of objects than subshifts of finite type. 
E.g., every nonempty one-dimensional subshift of finite type must contain a periodic configuration; for one- 
dimensional effectively closed subshifts this is not the case. However, the following theorem shows that two- 
dimensional subshifts of finite type are powerful enough to simulate any effectively closed one-dimensional 
subshift in the following sense (i.e., to simulate an effectively closed subshift, we need a subshift of finite 
type of dimension higher by 1): 

Theorem 10. Let A be some alphabet and let S be a one-dimensional effectively closed subshift over A. 
Then, there exist an alphabet B, a mapping r: B — t- A, and a two-dimensional subshift S' of finite type over B 
such that r-images of configurations in S' are (exactly) elements of S extended vertically {vertically aligned 
cells contain the same A-letter). 

(As we have mentioned, this result was independently obtained by Aubrun and Sablik using Robinson- 
style aperiodic tilings t2J.) 

Proof. The proof uses the same argument as in Theorem |9] Each cell now contains an A-letter that prop- 
agates vertically. Computation zones in macro-tiles generate (in available space and time) elements of the 
enumerable set of forbidden A-substrings and compare them with A-substrings that are made available to 
them. It remains to note that tiling requirements (matching colors) are local; that is, they define a finite type 
two-dimensional subshift. 
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Note that now the remark of Section 17.41 (the trick of extension of zones of responsibility for macro- 
tiles) becomes crucial, since otherwise the image of a configuration from S' may be a concatenation of 
two sequences (a left-infinite one and a right-infinite one); neither sequence contains forbidden patterns but 
forbidden patterns may appear at the point of concatenation. □ 

A similar argument shows that every two-dimensional effectively closed subshift can be represented as 
an image of a three-dimensional subshift of finite type (after a natural extension along the third dimension), 
any three-dimensional effectively closed subshift is an image of a four-dimensional subshift of finite type, 
etc. 

This result is an improvement of a similar one proved by Hochman (Theorem 1.4 in |fT8l|, where the 
dimension increases by 2), thus providing a solution of Problem 9.1 from ifTSl . Note also that it implies the 
result of Simpson |[32ll where one-dimensional sequences are embedded into two-dimensional tilings but in 
some weaker sense (defined in terms of Medvedev degrees). 

One can ask whether a dimension reduction is essential here. For example, is it true that every two- 
dimensional effectively closed subshift is an image of some two-dimensional subshift of finite type? The 
answer to this question (as well as related questions in higher dimensions) is negative. This follows from 
an upper bound in [7] saying that every tile set (unless it has no tilings at all) has a tiling such that all 
n X n squares in it have complexity 0{n) (a result that immediately translates for subshifts of finite type) 
and a result from |[30l that shows that some nonempty effectively closed two-dimensional subshift has nxn 
squares of complexity Therefore the latter cannot be an image of the first one (complexity can only 

decrease when we apply an alphabet mapping). 

9 Random errors 

9.1 Motivation and discussion 

In what follows we discuss tilings with faults. This means that there are some places (faults) where colors of 
neighbor tiles do not match. We are interested in "robust" tile sets: those that maintain some structure (for 
example, can be converted into an error-free tiling by changing a small fraction of tiles) if faults are sparse. 

There are two almost equivalent ways to define faulty tilings. We can speak about errors (places where 
two neighbor tiles do not match) or holes (places without tiles). Indeed, we can convert a tiling error into 
a hole (by deleting one of two nonmatching tiles) or convert a one-tile hole (one missing tile) into a small 
number of errors (at most 4) by placing an arbitrary tile there. Holes look more natural if we start with a set 
of holes and then try to tile the rest; however, if we imagine some process similar to crystallization when a 
tiling tries to become correct by some trial-and-error procedure, it is more natural to consider tiling errors. 
Since it makes little difference from the mathematical point of view, we use both metaphors. 

We use a hierarchical approach to hole patching that can be traced back to Gacs, who used it in a much 
more complicated situation [13|. This means that first we try to patch small holes that are not too close to 
each other (by changing small neighborhoods around them). This (if we are lucky enough) makes larger (and 
still unpatched) holes more isolated since there are fewer small holes around. Some of these larger holes 
(which are not too large and not too close to each other) can be patched again. Then, the same procedure 
can be repeated again for the next level. Of course, we need some conditions (that guarantee that holes are 
not too dense) to make this procedure successful. These conditions are described later in full detail, but the 
important question is the following: How do we ensure that these conditions are reasonable (i.e., general 
enough)? Our answer is as follows: We prove that if holes are generated at random (with each position 
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becoming a hole independently of other positions with small enough probability e), then the generated set 
satisfies these conditions with probability 1 . 

From the physics viewpoint, this argument sounds rather weak. If we imagine some crystallization 
process, errors in different positions are not independent at all. However, this approach could be a first 
approximation until a more adequate one is found. 

Note that patching holes in a tiling could be considered as a generalization of percolation theory. Indeed, 
let us consider a simple tile set made of two tiles: one with all black sides and the other with all white sides. 
Then, the tiling conditions reduce to the following simple condition: Each connected component of the 
complement to the set of holes is either completely black or completely white. We want to make small 
corrections in the tiling that patch the holes (and therefore make the entire plane black or white). This 
means that initially either we have small black "islands" in a white ocean or vice versa, which is exactly 
what percolation theory says (it guarantees that if holes are generated at random independently with small 
probability, the rest consists of one large connected component and many small islands.) 

This example shows also that simple conditions such as low density (in the Besicovitch sense) of the 
hole set are not enough. A regular grid of thin lines can have low density but still splits the plane into 
nonconnected squares; if half of these squares are black and the others are white, no small correction can 
patch the holes. 

One can define an appropriate notion of a sparse set in the framework of algorithmic randomness 
(Martin-Lof definition of randomness) by considering individual random sets (with respect to the Bernoulli 
distribution Bg) and their subsets as "sparse." Then, we can prove that any sparse set (in this sense) satisfies 
the conditions that are needed to make the iterative patching procedure work. This algorithmic notion of 
"sparseness" is discussed in [5]. However, in the current paper we do not assume that the reader is familiar 
with algorithmic randomness and restrict ourselves to classical probability theory. 

So our statements become quite lengthy and use probabilistic quantifiers "for almost all" (= with prob- 
ability 1). The order of quantifiers (existential, universal, and probabilistic) is important here. For example, 
the statement "a tile set T is robust" means that there exists some £ > such that for almost all E (with 
probability 1 with respect to the distribution where each point independently belongs to E with probability 
£) the following is true: For every (T,£')-tiling U there exists a T-tiling U' (of the entire plane) that is "close" 
to U. Here by {T,E)-ti\mg we mean a tiling of \£' (where existing pairs of neighbor tiles match). 

9.2 Islands of errors 

In this section we develop the notion of "sparsity" based on the iterative grouping of errors (or holes) and 
prove its properties. 

Let E C 1? be a. set of points; points in E are called dirty; other points are clean. Let j8 > a > be 
integers. A nonempty set X C £" is an {a,^)-island in E if 

(1) the diameter of X does not exceed a and 

(2) in the j3 -neighborhood of X there is no other point from E. 

(The diameter of a set is a maximal distance between its elements; the distance d is defined as Zoo, i.e., 
the maximum of distances along both coordinates; the j3 -neighborhood of X is a set of all points y such that 
d{y,x) < j8 for some x € X.) 

It is easy to see that two (different) islands are disjoint (and the distance between their points is greater 
than j3). 

Let (ai,j3i), (a2)i32), . . • be a sequence of pairs of integers and a, < j8,- for all /. Consider the following 
iterative "cleaning" procedure. At the first step we find all (o:i,j8i)-islands {rank 1 islands) and remove 
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Figure 7: Explanation tree; vertical lines connect different names for the same points. 

all their elements from E (thus getting a smaller set E\). Then, we find all (a2>i32) -islands in E\ {rank 2 
islands); removing them, we get E2 d E\, etc. The cleaning process is successful if every dirty point is 
removed at some stage. 

At the /th step we also keep track of the j3, -neighborhoods of islands deleted during this step. A point 
X G is affected during the /th step if x belongs to one of these neighborhoods. 

The set E is called sparse (for a given sequence a,-,j3,) if the cleaning process is successful, and, more- 
over, every point x € is affected at finitely many steps only (i.e., x is far from islands of sufficiently large 
ranks). 

The values of a, and j3, should be chosen in such a way that for sufficiently small £ > a Bg -random set 
is sparse with probability 1 . (As we have said, this justifies that our notion of sparsity is not unreasonably 
restrictive.) The sufficient conditions are provided by the following statement: 

Lemma 3. Assume that 

8 XI < < i8„ for every n and ^2 < °°- 

k<n i ^ 

Then, for all sufficiently small e > a Be-random set is sparse with probability 1. 

Proof. Let us estimate the probability of the event "x is not cleaned after n steps" for a given point x (the 
probability of this event does not depend on x). If x G E„, then x belongs to and is not cleaned during 
the «th step (when (a„, j8„)-islands in are removed; by definition we let Eq = E). Then, x G and, 
moreover, there exists some other point x\ G E„^\ such that d{x,x\) is greater than a„/2 but not greater 
than j3„ + o;„/2 (note that j8„ + a„/2 < 2j8„). Indeed, if there were no such xi in S,,-!, then the (a„/2)- 
neighborhood of x in E„^i would be an (a„,j8„) -island in £",,-1 and x would be removed. 

Further, we apply the same argument on level {n— 1). Each of the points xi and x (we use notation xq 
for X, to make the notation more uniform) belongs to E„^[; therefore it belongs to E„^2 together with some 
other point (at a distance greater than a„_i/2 but not exceeding 2j8„_i). Denote these two other points in 
En-2 by xoi (which exists because xq G £",1-1) and xn (which exists because xi G £",1-1) respectively. Thus, 
we have at least four points denoted by xqo = xq = x, xqi , xio = xi , and xi 1 in En^2- Then, we repeat the same 
argument for levels (« — 2), (?i — 3), etc. In this way we get a tree (Fig. |7]) that "explains" why x belongs to 

The distance between xq and xi in this tree is at least a„/2 whereas the diameter of the subtrees starting 
at Xq and xi does not exceed L,<„2j8,-. Therefore, the lemma's assumption guarantees that these subtrees 
cannot intersect. Since it is true on all levels, all the leaves of the tree are different. Note that all 2" leaves 
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of the tree belong to E = Eq. As every point appears in E independently of other points, each "explanation 
tree" is valid with probability £^ ". It remains to estimate the number of possible explanation trees for a given 
point X. 

To specify xi we need to specify the difference (vertical and horizontal distances) between xq and xi. 
Neither distance exceeds 2/3„; therefore we need about 21og(4j8„) bits to specify them (including the sign 
bits). Then, we need to specify the difference between xqq and xqi as well as the difference between xio and 
xii; this requires at most 41og(4j8„_i) bits. To specify the entire tree we therefore need 

21og(4/3„) + 41og(4j8„_i) + 81og(4j8„_2) + • • • + 2"log(4/3i) 
bits. Reversing the sum and taking out the factor 2", we can rewrite this expression as 

2"(log(4/3i) + log(4/32)/2 + ---). 

Since the series Y^log jS„/2" converges by assumption, the total number of explanation trees for a given point 
(and given n) does not exceed 2'^^^"\ so the probability for a given point x to be in E^ for a Bg -random E 
does not exceed e^"2'^^^"\ which tends to (even super-exponentially fast) as « — )• oo, assuming that e is 
small enough. 

We conclude that the event ";c is not cleaned" (for a given point x) has zero probability; the countable 
additivity guarantees that with probability 1 all points in I? are cleaned. 

It remains to show that every point with probability 1 is affected at finitely many steps only. Indeed, if x 
is affected at step n, then some point in its j8„ -neighborhood belongs to E„, and the probabihty of this event 
is at most 

0(jS,^)e2"2^(^") = 221°gA,+0(2")-log(l/e)2". 

the convergence conditions guarantees that logj3„ = o(2"), so the first term is negligible compared to the 
others, the probability series converges (for small enough e) and the Borel-Cantelli lemma gives the desired 
result. □ 

For our next step, we note that by definition a sparse set is split into a union of islands of different ranks. 
Now we prove that these islands together occupy only a small part of the plane. To formalize this statement, 
we use the notion of Besicovitch size (density) of a set £ C Z^. Let us recall the definition. Fix some point 
O of the plane and consider squares of increasing size centered at O. For each square consider the fraction 
of points in this square that belong to E. The limsup of these frequencies is called the Besicovitch density 
of E. (Note that the choice of the center point O does not matter, since for any two points Oi and O2 large 
squares of the same size centered at Oi and O2 share most of their points.) 

By definition the distance between two rank k islands is at least Pk- Therefore the (j8i:/2) -neighborhoods 
of these islands are disjoint. Each of the islands contains at most points (it can be placed in a rectangle 
that has sides at most at). Each neighborhood has at least jS^ points (since it contains a. x fii^ square 
centered at any point of the island). Therefore the union of all rank k islands has Besicovitch density at 
most {ak/^kY- Indeed, for a large square the islands near its border can be ignored, and all other islands are 
surrounded by disjoint neighborhoods where their density is bounded by {ak/^kY, see Fig. [8] 

One would like to conclude that the overall density of all islands (of all ranks) does not exceed Y.k{^k/hY- 
However, the Besicovitch density is in general not countably semiadditive (for example, the union of finite 
sets having density may have density 1), but in our case we are helped by the second requirement of the 
definition of a sparse set (each point is covered by only finitely many neighborhoods of islands). 
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Figure 8: Rank k islands form a set of low density. (In this picture each island is shown as a rectangle, which 
is not always the case.) 

Lemma 4. Let E be a sparse set for a given family of ttk and pk- Then, the Besicovitch density of E is 

Proof. Let be a center point used in the definition of Besicovitch density. By definition of sparsity, this 
point is not covered by j8yt-neighborhoods of rank k islands if k is greater than some K. Now we split the 
set E into two parts: one (£"<) formed by islands of rank at most K and the other (£>) formed by all islands 
of bigger ranks. As we have just seen, in a large square the share of £"< is bounded by Y.k<K{(^k/ up 
to negligible (as the size goes to infinity) boundary effects (where we consider each k < K separately and 
then sum over all k < K). A similar bound is valid for rank k islands with k > K, though the argument is 
different and a constant factor appears. Indeed, the j3yt-neighborhood of every island / does not contain the 
center point O. Therefore, any square 5 centered at O that intersects the island also contains a significant part 
of its (j8i:/2)-neighborhood A^: The intersection of and S contains at least {^k/'^Y elements, see Fig. |9] 
Therefore, the share of in S is bounded by 4^,t>/f (a^/ jS^t)^. □ 

Remark. It is easy to choose ak and satisfying the conditions of Lemma [3] and having arbitrarily 
small Y.{(Xk/PkY (by taking geometric sequences that grow fast enough). Therefore we get the following 
well-known result as a corollary of Lemmas |3] and |4j For every a > there exists £ > such that with 
probability 1 a Bg-random set has Besicovitch density less than a. (In fact, a much stronger result is well 
known: By the strong law of large numbers a Bg-random set has Besicovitch density e with probability 1.) 

In fact we will need a slightly more complicated version of Lemma |4] We are interested not only in 
the Besicovitch density of a sparse set E but also in the Besicovitch density of a larger set: the union of 
7/:-neighborhoods of rank k islands in E. Here are some parameters; in most applications we set = cak 
for some constant c. The same argument gives the bound 4^((ai + 27;^)/j3i)^. Assuming that 7^ > ak, we 
can rewrite this bound as OiX^iYk/ hY)- So we arrive at the following statement: 

Lemma 5. Let E be a sparse set for a given family of (Xk and pk and let Jk ^ OCk be some integers. 
Then, the union of y^-neighborhoods of level k islands {over all k and all islands) has Besicovitch density 

oinjk/m- 
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ft/2 



part of the j3i^/2-neighborhood 
of an island 



O 



part of the j3<^/2-neighborhood 
of the island that is guaranteed 
to be inside S 



Figure 9: Together with a point in a rank k island, every square S contains at least (j8i/2)^ points of its 
{f^k/'^) -neighborhood. 

9.3 Islands as a tool in percolation theory 

Let us show how some basic results of percolation theory can be proved using the island technique. 

Theorem 11. For some OCk and pk satisfying the requirements of Lemma\3\the complement of any sparse 
set E contains exactly one infinite connected component C; the complement of C has Besicovitch density 
OCLak/pkf. 

Proof. Let Jk = '^oCk- (The choice of Uk and pk will be discussed later.) For every k and for every rank k 
island fix a point in this island and consider the -neighborhood of this point. It is a square containing the 
entire island plus an additional "security zone" of width at, contained in the //^-neighborhood of the island, 
see Fig.fTOl 



Yk , Yk 



Figure 10: A point in a rank k island, its 7<:-neighborhood, and the security zone of width ak- 
It is enough to prove the following three statements: 

• The union U of all these squares (for all ranks) contains the set E and has Besicovitch density 

• The complement of U is connected. 

• There are no other infinite connected component in the complements ofE. 
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The first statement is a direct corollary of Lemma[5] above. 

To prove tlie second statement, consider two points x and y outside U . We need to prove that x and y 
can be connected by a path that is entirely outside U . Let us connect x and y by some path (say, one of the 
shortest paths) and then push this path out of U . Consider squares of maximal rank that intersect this path. 
For each of them, consider the first moment when the path gets into the square and the last moment when 
the path goes out, and connect these two points by a path outside the square, see Fig.fTTl 



Now the new path is ajt-separated from this island of rank k. Provided pk — 7k> GCk, the path after the 
correction is ai:-separated from all other islands of rank greater than ^ — 1. Indeed, the "modified" part of 
the corrected path (the points of the path involved into the correction procedure) remains at a distance of at 
most Yif- from the given k level island; hence, it must remain at a distance at least from all other islands 
of rank k and higher. Note also that the shift (the distance between the original path and the corrected one) 
does not exceed 2^. 

Then, we can do the same for islands of rank k — I (pushing the path out of surrounding squares). Note 
that since at each step the drift is bounded by Ijk-i, we will not bump into islands of rank k. 

Repeating this process for decreasing k, we finally get a path that connects x and y and goes entirely 
outside U. For this we need only the total drift on the smaller levels (which is bounded by 2^,<^ jj) to be 
less than a^.. This is easy to achieve if ak, fik^ and are suitable geometric sequences. 

It remains to show that every infinite connected set intersects the complement ofU. To show this, let us 
take a big circular path centered at the origin and then push it out of U as described above. Since the center 
is outside the jS^-neighborhoods of islands for large enough k, we may assume that the sizes of islands that 
intersect this circle are small compared to its radius (say, less than 1% of it, which can be guaranteed if the 
geometric sequences at, pk, and grow fast enough). Then, after the change the circle will still encircle a 
large neighborhood of the origin, so any infinite connected component should cross such a circle. □ 

9.4 Bi-islands of errors 

In the proof of our main result (Section [T3] ) we need a more delicate version of the definition of islands. 
In fact we need such a definition that some counterpart of Lemma |3] could be applied even if the sequence 
logjS„ grows much faster than 2" (e.g., for j8„ = c'^-^^"). In this section we define bi-islands (a generalization 
of the notion of islands from Section 1931) and prove bi-island versions of Lemmas |3l|4l and [5] The reader 
can safely skip this section for now and return here before reading Section [T3l 

Let £■ C be a set of points. As in Section l9!2l we call points in E dirty, and the other points clean. 
Let j8 > a > be integers. A nonempty set X C £ is an {a, p) -bi-island in £ if X can be represented as the 
union of some sets Xq, Xi such that 

(1) in the /3 -neighborhood of X = UXi there are no points from E\X; 




Figure 1 1 : Pushing a path out of the square. 
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(2) the diameters of Xq and Xi do not exceed a; and 

(3) the distance between Xq and Xi does not exceed j3. 

(See Fig.[T2j) In particular, an (a, j8)-island is a special case of an (a, j8)-bi-island (by letting Xi be empty). 




Figure 12: A bi-island, a union of two "islands" that are close to each other. 
Note that one may split the same bi-island into Xq and Xi in different ways. 

Obviously, every two different bi-islands are disjoint. Moreover, the distance between them is greater 
than j3. The diameter of a bi-island is at most (2a + j8). 

Let («! , j8i), {a2,p2),- • . be a sequence of pairs of integers and a,- < j3,- for all /. We define an iterative 
cleaning procedure for bi-islands. At the first step we find all (ai,j8i) -bi-islands and remove all their ele- 
ments from E (getting a smaller set E[). Then, we find in Ei all (a2,jS2)-bi-islands; removing them, we get 
E2 C El, etc. The cleaning process is successful if every dirty point is removed at some stage. 

Similarly to the case of islands, we say that a point x ^ is affected during step / if x belongs to the 
j8, -neighborhood of one of the bi-islands of rank /. 

The set E is called bi-sparse (for a given sequence o;,-,j3,) if the cleaning process defined above is suc- 
cessful, and, moreover, every point x ^I? is affected at finitely many steps only (which means that x is far 
from bi-islands of sufficiently large ranks). 

We choose the values of a, and j8, in such a way that for sufficiently small £ > a Bg -random set is 
bi-sparse with probability 1 . The main achievement here is that the convergence condition is now weaker 
than in the corresponding statement for islands (LemmaO: 

Lemma 6. Assume that 

12 52 < a„ < j3„ for every n, and — < °°- 

k<n i 

Then, for all sufficiently small £ > 0, a Bg-random set is bi-sparse with probability 1. 

Proof. The proof of Lemma[6]is very similar to the proof of Lemma[3] At first we estimate the probability 
of the event "x is not cleaned after n steps" for a given point x. If x ^ £"„, then x belongs to £",,-1 and is 
not cleaned during the nth step (when (a„, j8„)-bi-islands in £",,-1 are removed). Then, x G £",,-1. Moreover, 
we show that there exist two other points xi,X2 G £,1-1 such that the three distances d{x,xi), d{x,X2), and 
d{xi,X2) are all greater than a„/2 but not greater than 2j3„ + 2{a„/2) < 3j3„. 
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LetXo be the (a„/2) -neighborhood of x in E. If Xq were an island, it would be removed. Since this does 
not occur, there is a point xi outside Xq but in the j3„ -neighborhood of Xq. 

Let X\ be the (a„/2) -neighborhood of x\ in E. Again Xq and X\ do not form a bi-island. Both sets Xq 
and Xi have diameter at most a„, and the distance between them is at most j8„. So the only reason why they 
are not a bi-island is that there exists a point X2 ^ E outside Xq UXi but in the j8„ -neighborhood of it. The 
points xi and xj have the required properties (the distances d(x,xi), d(x,X2), and d(xi,X2) are greater than 
a,j/2 but not greater than 3j8„). 

To make the notation uniform, we denote x by xq. Each of the points xo,xi,X2 belongs to En-\. This 
means that each of them belongs to E„^2 together with a pair of other points (at a distance greater than 
(Xn-\/2 but not exceeding 3j8„_i). In this way we get a ternary tree that "explains" why x belongs to £„. 

The distance between every two points among xq, x\, and X2 in this tree is at least a„/2 whereas the 
diameters of the subtrees starting at xq, x\, and X2 do not exceed Y^i^n^^i- Thus, the lemma's assumption 
guarantees that these subtrees cannot intersect and that all the leaves of the tree are different. The number 
of leaves in this ternary tree is 3", and they all belong XaE = Eq. Every point appears in E independently of 
other points; hence, one such "explanation tree" is valid with probability £^". It remains to count the number 
of all explanation trees for a given point x. 

To specify x\ and X2 we need to specify horizontal and vertical distances between xq and x\^X2- These 
distances do not exceed 3j3„; therefore we need about 41og(6j3„) bits to specify them (including the sign 
bits). Then, we need to specify the distances between x^) and xoi,a;o2 as well as the distances between jcio 
and x\\,x\2 and between ;c20 and X2\,X2i. This requires at most 121og(6j8„_i) bits. To specify the entire tree 
we therefore need 

41og(6j8„) + 121og(6A,-i) + 361og(6j8„_2) + • • • +4 • 3"-' log(6i3i), 

which is equal to 4 • 3"^^(log(6j8i) + log(6j82)/3 H ). The series l^logj8„/3" converges by assumption; 

so, the total number of explanation trees for a given point (and given n) does not exceed 2'^'^"'. Hence, the 
probability for a given point x to be in £"„ for a Bg-random E does not exceed £^"2'^(^"\ which tends to 
as « — )■ oo (assuming that e is small enough). 

We conclude that the event "jc is not cleaned" (for a given point x) has zero probability; hence, with 
probability 1 all points in 1? are cleaned. 

It remains to show that every point with probability 1 is affected by finitely many steps only. Indeed, if x 
is affected by step n, then some point in its j8„ -neighborhood belongs to £„, and the probabiUty of this event 
is at most 

0(^2^£3"20(3") ^ 221°gA,+0(3")-log(l/e)3"_ 

From the convergence conditions we have logj8„ = o(3"), so the first term is negligible compared to others. 
The probability series converges (for small enough £) and the Borel-Cantelli lemma gives the result. □ 

By definition, a bi-sparse set is split into a union of bi-islands of different ranks. Such bi-islands occupy 
only a small part of the plane: 

Lemma 7. Let E he a bi-sparse set for a given family of and fit- Then, the Besicovitch density of E is 

oinock/Pk?). 

Proof. The proof of Lemma |7]repeats the proofs of Lemma |4l □ 

Recalling Lemma|5] we may consider a sequence of numbers such that yk>(Xk- Then, the Besicovitch 
density of the union of /^.-neighborhoods of rank k bi-islands (for all k and for all islands) is bounded by 
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However, this statement is not enougli for us. In Section [13] we will need a kind of "closure" of the 
//.-neighborhood of a bi-island: 

Definition. Let S be a k-level bi-island. We say that {x,y) G belongs to the extended y-neighborhood of 
S if there exist two points (x,y'),{x,y") G Z? {with the same first coordinate) such that dist(5', (x,/)) < 7, 
dist(5, {x,y")) < 7, andy' <y<y" (see Fig. \T3^. 

Figure 13: An extended neighborhood of a bi-island consists of the neighborhoods of its two parts and a 
zone between them. 

The meaning of the last definition is quite simple: We take not only the points that are close to S but also 
those points that are placed somehow between the neighborhoods of So and Si . 

Lemma 8. Let E be a bi-sparse set for a given family of ak and pk satisfying the conditions ofLemma^ 
Let Yk be a sequence of numbers such that au < Jh cmd the series Y,iYk/Pk) converges. Then, the Besicovitch 
density of the union of extended yk-neighborhoods of rank k bi-islands in E is bounded by 0{Y,iyk/ Pk))- 

Proof. The arguments are similar to the proof of Lemma |5l An extended 7^-neighborhood of a k-level 
island can be covered by a rectangle of width 0{yk) and height 0{l5k + yk)', so its area is 0{ykpk) (since 
Tic < jSjt). The distance between any two bi-islands of rank k is at least p^. Hence, the fraction of extended 
)4-neighborhoods of islands is yj^/ j8,t) (this is similar to the bound 0(L(7«:/j8/t)^)> which holds for simple 
7, -neighborhoods). □ 

Lemmas [6]-[8] will be used in Section [T3l (The arguments of Sections [T0] - [T2] do not refer to bi-islands.) 
These lemmas will be used for cCk,l5k such that loga^ ~ for q > 2, ^ OLk+\, and y^ = 0{ak) or y^ = 
0{al). Note that we cannot apply Lemmas [3] and |4] (about islands) for these parameters because logjS^; 
grows faster than 2^. So we need to deal with bi-islands. 

In the definition of sparse sets in Section l9!2l each single island of rank k must be isolated from other 
islands of rank k. In this section we modified this definition and allowed an island to be close to at most 
one other island of the same rank. In a similar way, we could define ^-islands for any s>2, assuming that 
clusters of s islands of rank k (rather close to each other) are authorized. A set that can be represented as a 
union of s-islands of different ranks can be called i^-sparse. A generalization of Lemmas[6]can be proven: A 
random set is ^'-sparse with probability 1 if L (log A' )/(■*+ 1)' converges. However, we do not develop here 
the general theory of ^-sparse sets. The concept of bi-islands and bi-sparsity (i.e., the case s = 2)h enough 
for all our applications in Section [T3l 
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10 Robust tile sets 

In this section we construct an aperiodic tile set where isolated defects can be healed. 




Figure 14: Patching holes. 

Let ci < C2 be positive integers. We say that a tile set T is {c\,C2)-robust if the following holds: For 
every A and for every T-tiling U of the (c2 A) -neighborhood of a square A x A excluding the square itself there 
exists a tiling V of the entire (c2 A) -neighborhood of the square (including the square itself) that coincides 
with U outside of the (ci A) -neighborhood of the square (see Fig.[T4l). 

Theorem 12. There exists a self-similar tile set that is (ci , C2) -robust for some ci and C2- 

Proof For every tile set jj. it is easy to construct a "robustified" version /i' of /i, i.e., a tile set /i' and 
a mapping d : }x' ^ such that (a) 5-images of /^'-tilings are exactly /i -tilings and (b) }x' is "5-robust": 
Every /^'-tiling of a 5 x 5 square minus 3x3 hole (see Fig.[T5]) can be uniquely extended to the tiling of the 
entire 5x5 square. 



Figure 15: Filling a 3 x 3 hole. 

Indeed, it is enough to keep in one ^u'-tile the information about the 5x5 square in /i -tiling. Matching 
rules will guarantee that the information about the intersection (4x5 rectangle) is consistent in neighbor 
tiles. Then, a 3 x 3 hole (as shown in the picture) is not fatal. It is easy to see that the tiles at its border (gray) 
are consistent and contain all the information the missing tiles should have. (In fact, using more careful 
estimates one can replace in our argument the 5x5 squares by 4 x 4 squares; but we do not care much about 
constants.) 

This robustification can be easily combined with the fixed-point construction. In this way we can get a 
"5-robust" self-similar tile set T if the zoom factor N (which is considered to be fixed in this argument) is 
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large enough. It remains to explain that "5-robustness" (in the sense described above) implies also (ci,C2)- 
robust for some ci and C2- (The values of ci and C2 depend on N, but N is fixed.) 

Indeed, assume that a tiling of a large enough neighborhood around a A x A hole is given. Denote by k the 
minimal integer such that N'^ > A (so the ^-level macro-tiles are greater than the hole under consideration). 
Note that the size of the k-\evel macro-tiles is linear in 0(A) since N'^ < N -A. 

In the tiling around the hole, an N xN block structure is correct except for the A/^-neighborhood of the 
central A x A hole. Indeed, the colors encode coordinates, so in every connected tiled region the coordinates 
are consistent. For similar reasons an x structure is correct except for the (A/^+A/^^) -neighborhood of 
the hole, etc. Hence, for the chosen k we get a fc-level structure that is correct except for (at most) 9 = 3x3 
squares of level k, so we can delete everything in these squares and use 5-robustness to replace them with 
macro-tiles that correspond to replacement tiles. 

To start this procedure (and fill the hole), we need a correct tiling only in the 0{N'^) neighborhood of the 
hole. (Technically, we need to have a correct tiling in the (3A^'^) -neighborhood of the hole; as 3A'^'^ < 3A^A, 
we let C2 = 3A'^.) The correction procedure involves changes in another 0(A'^'^) -neighborhood of the hole. 
(Technically, the changes touch (2A'^'^)-neig hborhood of the hole; 2N'' < 2NA, so we let ci = 2N.) □ 

11 Robust tile sets with variable zoom factors 

The construction from the previous section works only for self-similar tilings with a fixed zoom factor. It 
is enough for simple applications, as we will see in Section [121 However, in the proof of our main result 
in Section [13] we need a variable zoom factor. So here we develop a technique suitable for this case. This 
section can be skipped now but it should be read before Section [T3l 

Now we explain how to get "robust" fixed-point tilings with variable zoom factors N\,N2, As well 

as in the case of a fixed zoom factor, the idea is that ^-level macro-tiles are "responsible" for healing holes 
of size comparable with these macro-tiles. 

Let Ao < Ai < A2 < . . . be a sequence of integers. Let ci < C2 be positive integers. We say that a tile 
set T is {c I, C2) -robust against holes of size Ao,Ai,... if the following holds: For every n and for every 
T-tiling U of the C2Ak neighborhood of a square A/t x A^; excluding the square itself there exists a tiling V 
of the entire C2A/; neighborhood of the square (including the square itself) that coincides with U outside of 
the c\Ak neighborhood of the square. The difference from the definition of Section [TOjis that we take only 
values A G {Aq, Ai , . . .} instead of holes of arbitrary size. 

Lemma 9. Assume a sequence of zoom factors Nk grows not too fast and not too slow {it is enough to 
assume that Nk > Clog k and C\ogNk+\ < Nkfor a large enough C; cf. the discussion in Section O. Then, 
there exists a tile set with variable zoom factors Nt {k-level macro-tiles of size = Nq - ■ -Nk-i) that is 
{c I, C2) -robust (for some c\ and C2) against holes of size Lq,Li,. . .. 

Proof. First, we apply the fixed-point construction from Section [5] and get a tile set that is "self-similar" 
with variable zoom factors Ni,N2,--- Denote by the family of fc-level macro-tiles corresponding to this 
tile set. 

Further we make a "robustified" version of this tile set. To this end we basically repeat the arguments 
from Section [To] (the proof of Theorem [T2l). The difference in the argument is that now we deal with variable 
zoom factors, and sizes of holes are taken from the sequence Lo,Li , 

Denote by ju( the family of ^-level macro-tiles for the new tiling. We need that there exists a mapping 
5 : n'l^^ such that (a) 5-images of /i(-tilings are exactly /iyt -tilings and (b) ^u^ is "5-robust": Every /i^- 
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tiling of a 5 X 5 square minus a 3 x 3 hole (see again Fig. [TSl) can be uniquely extended to the tiling of the 
entire 5x5 square. 

To get such a robustification, it is enough to keep in every /i[-macro-tile the information about the 
5x5 square in the /Xjt-tiling and use the colors on the borders to ensure that this information is coherent in 
neighbor macro-tiles. 

As usual, this robustification can be combined with the fixed-point construction. We get 5-robust macro- 
tiles for all levels of our construction. "Self-similarity" guarantees that the same property holds for macro- 
tiles of all levels, which implies the required property of generalized robustness. 

Indeed, assume that a tiling of a large enough neighborhood around a A x A hole is given, and A<Lk for 
some k. In the tiling around the hole, an (Li x Li) block structure, is correct except for only the Li neighbor- 
hood of the hole. For similar reasons an (L2 x L2) structure is correct except for the {L1+L2) neighborhood, 
etc. So we get a ^-level structure that is correct except for (at most) 9 = 3x3 squares of size x L^. 
Because of 5-robustness, this hole can be filled with fc-level macro-tiles. Note that reconstruction of ground- 
level tiles inside a high-level macro-tile is unique after we know its "conscious known" information, i.e., 
the content of the tape of the Turing machine simulated on the computation zone of this macro-tile. (This 
information is reconstructed from the consciously known information of the neighbor macro-tiles.) [For the 
maximal complexity tile set (Section [Til it is not the case, and the absence of this property will become a 
problem in Section [13] where we robustify it. To solve this problem, we will need to use error-correcting 
codes.] 

To implement the patching procedure (and fill the hole) we need to have a correct tiling in the 0{Lic) 
neighborhood of the hole. The correction procedure involves changes in another 0{Lic) neighborhood of the 
hole. More technically, we need to have a correct tiling in the (3L|) -neighborhood of a hole of size L^, so 
we let C2 = 3. Since the correction procedure involves changes in the (2L/:) -neighborhood of the hole, we 
let ci = 2. □ 

We can robustify tiling not only against holes but against pairs of holes. To this end we slightly modify 
our definition of robustness. Let Aq < Ai < A2 < . . . be an increasing sequence of integers, and let ci < 
be positive integers. We say that a tile set z is {c 1,02) -robust against pairs of holes of size Aq, Ai , ... if the 
following holds: Let us have two sets H\,H2 C I?, each of them of diameter at most (for some k > 0). 
For every T-tiling U of the C2A1C neighborhood of the union {Hi U H2) excluding Hi and H2 themselves 
there exists a tiling V of the entire C2A/t neighborhood of {Hi UH2) (including Hi and H2 themselves) that 
coincides with U outside of the ciA^ neighborhood of {Hi U//2). 

A robustification against pairs of holes can be done in the same way as the robustification against a 
single isolated hole. Indeed, if these two holes are far apart from each other, we can "correct" them inde- 
pendently; if they are rather close to each other, we correct them as one hole of (roughly) doubled size. So 
we can employ the same robustification technique as before; we need only to take a large enough "radius of 
multiplication" D (and use D-robustness instead of 5-robustness). So we get the following generalization of 
Lemma |9) 

Lemma 10. Assume a sequence of zoom factors Nk grows not too fast and not too slow {e.g., Nk > Clog^ 
and ClogNjc+i < Nkfor a large enough C). Then, there exists a tile set with zoom factors Nk (i.e., with 
k-level macro-tiles of size Lk= Nq-- -Nk-i) that is {ci,C2)-robust (for some ci and C2) against pairs of holes 
of size Lq,Li , . . .for some ci and C2. 

Of course, similar propositions can be also proven for triplets, quadruplets, and any other sets of holes of 
bounded cardinality. However, in this paper we consider only pairs of holes; this is enough for our argument 
in Section [T3I 
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12 Strongly aperiodic robust tile sets 



Now we are ready to apply the islands technique to construct a robust strongly aperiodic tile set. We start 
with a formal definition of a tiling with errors (see the motivation and discussion in Section [9?T]) . 

Definition. For a subset £ C and a tile set T we call by a {T,E)-tiling any mapping 

T : {I?\E)^T 

such that for every two neighbor cells x^y G 1?\E, tiles T{x) and T(y) satisfy the tiling rules (colors on 
adjacent sides match). We may say that T is a t -tiling of the plane with holes at points of E. 

Theorem 13. There exists a tile set x with the following properties: (1) x-tilings of I? exist and {2) for all 
sufficiently small £ for almost every {with respect to B^) subset E C 1? every {x,E)-tiling is at least 1/10 
Besicovitch apart from every periodic mapping F : 1? ^ X. 

Remark 1. Since the tiling contains holes, we need to specify how we treat the holes when defining 
the Besicovitch distance. We do not count points in E as points where two mappings differ; this makes our 
statement stronger. 

Remark 2. The constant 1/10 is not optimal and can be replaced by any other constant a < 1. 

Proof. Consider a tile set T such that (a) all T-tilings are a-aperiodic for every a < 1/4 and (b) T is {c\,C2)- 
robust for some c\ and C2. Such a tile set can be constructed by combining the arguments used for Theo- 
rems[l2]and HI More precisely, we take as the "basic" construction the tile set from the proof of Theorem|4] 
(which simulates the Thue-Morse substitution). Then, we "robustify" it by the procedure from the proof of 
Theorem [121 For the robustified tile set we know that each macro-tile in a tiling keeps the conscious infor- 
mation that was given (in the "basic" tile set) to all macro-tiles in its 5 x 5 -neighborhood; so the new tiling 
is not only strongly aperiodic but also 5-robust. It remains to show that this construction implies claim (2) 
of the theorem. 

We want to apply our probabilistic lemmas concerning "island of errors". We need to choose au and 
such that 

• the conditions of Lemma|3](p.|28]) are satisfied, and therefore a random error set with probability 1 is 
sparse with respect to these au and ^k', 

• for every sparse set E C T?, every (T,£')-tiling can be iteratively corrected (by changing it in the 
neighborhoods of islands of all ranks) into a T-tiling of the entire plane; and 

• the Besicovitch distance between the tilings before and after correction is small. 

Then, we conclude that the original (T,£')-tiling is strongly aperiodic since the corrected tiling is strongly 
aperiodic and close to the original one. 

To implement this plan, we use the following lemma that describes the error-correction process. 

Lemma 11. Assume that a tile set T is {c\,C2)-robust, > 4c2CCkfor every k, and a set E C 'Z? is sparse 
(with parameters a^, Pk)- Then, every {z,E)-tiling can be transformed into a T-tiling of the entire plane by 
changing it in the union of {2c\(Xk)-neighborhoods of rank k islands (for all islands of all ranks). 
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Proof. Note that (j8<:/2) -neighborhoods of rank k islands are disjoint and large enough to perform the error 
correction of rank k islands, since > 4c2CCk- The definition of a sparse set guarantees also that every 
point is changed only finitely many times (so the limit tiling is well defined) and that the limit tiling has no 
errors. □ 

The Besicovitch density of the changed part of a tiling can be estimated using LemmalU Here ]^ = 2ci ak 
is proportional to a^, so the Besicovitch distance between the original and corrected tilings (in Lemma [TT]) 
is 0{Y,k{(^k/ Pk)'^)- (Note that the constant in O notation depends on ci.) 

It remains to choose Uk and pk- We have to satisfy all the inequalities in Lemmas [3l|4l and[TT] To 
satisfy Lemmas |4] and [IH we may let jik = ckUk for large enough c. To satisfy Lemma |3j we may let 

OCfc+i = 8(j8i H h pk) + 1- Then, ak and jS^ grow faster than any geometric sequence (like k\ multiplied by 

some exponent in k), but still log fik is bounded by a polynomial in k and the series in Lemma [3] converges. 

With these parameters (and taking c large enough) we guarantee that the Besicovitch distance between 
the original {r,E)-ti\mg and the corrected T-tiling does not exceed, say, 1/100. 

Now assume that some (T,£')-tiling V is at a distance less than 1/10 from some periodic configuration W 
(with a period v). As we just explained, the original (T,£')-tiling V must be at a distance at most 1/100 from 
some correct T-tiling V'. Let us consider the v-shift of both configurations V and V' (W is shifted to itself). 
It is easy to see that the distance between the initial and the shifted copies of configuration V' is not greater 
than the sum dist(y',y) +dist(V,W) taken twice. Since the corrected tiling V' must be 1/4-aperiodic, and 
1/4 > 2(1/10+1/100), we get a conti-adiction. □ 



13 Robust tile sets that enforce complex tilings 

In this section we prove the main result of the paper. We construct a tile set that guarantees large Kolmogorov 
complexity of every tiling and that is robust with respect to random errors. 

Theorem 14. There exists a tile set z and constants ci,C2 > with the following properties: 

(1) a T-tiling ofl? exists; 

(2) for every T-tiling T of the plane, every N x N square of T has Kolmogorov complexity at least 
C1N-C2; 

(3) for all sufficiently small £ for almost every {with respect to the Bernoulli distribution Be) subset 
E C Z^, every {z,E)-tiling is at most 1/10 Besicovitch apart from some T-tiling of the entire plane 1?; 

(4) for all sufficiently small £ for almost every B^-random subset E C Z?, for every {r ,E)-tiling T the 
Kolmogorov complexity of centered squares ofT of size N x N is Q.(N). 

The rest of the section is devoted to the proof of this theorem. It combines almost all technique de- 
veloped in this paper: self-similar tile sets with variable zoom factors, embedding a sequence with Levin's 
property (i.e., with linear Kolmogorov complexity of all factors) into tiUngs, bi-sparse sets, incremental error 
correcting, and robustness against doubled holes. 

In this section the basic idea of incremental error correcting is applied in a slightly modified form. 
Here we cannot apply directly the technique of (ci,C2)-robustness from Section [TOl Instead we use the 
idea of robustness against holes of some sequence of sizes Aq, Ai, A2, . . ., as explained in Section [TT] More 
precisely, we do it as follows: We split the set of random errors into bi-islands of different ranks. Then, 
we eliminate them one by one, starting from lower ranks. When we correct an isolated bi-island of rank k, 
we need a precondition (similarly to the argument in Section [TOll: In a large enough neighborhood of this 
bi-island there are no other errors. Elimination of a ^-level bi-island involves corrections in its extended 
O (Ait) -neighborhood (with all parameters as specified below). 
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13.1 The main difficulties and ways to circumvent them 

We want to combine the construction from Section |7] with error-correcting methods based on the idea of 
"islands" of errors. There are two main difficulties in this plan: fast growing zoom factors and gaps in 
vertical columns. Let us discuss these two problems in some detail. 

The first problem is that our construction of tiling with high Kolmogorov complexity from Section |7] 
requires variable zoom factors. What is even worse is that zoom factors Nk must increase very fast (with 
logarithms growing faster than 2*^). Hence, we cannot directly apply the technique of islands from Sec- 
tion 19.21 since it works only when ^ ^^^p^ < °° (where is the parameter from the definition of islands, 
which, in our construction, must be of the same order as the size of k-\evel macro-tiles). To overcome this 
obstacle, we replace islands by bi-islands (the technique developed in Section [94] ). 

The second problem is that now we cannot reconstruct a macro-tile from the information "consciously 
known" to this macro-tile. The missing information is the sequence of bits assigned to the vertical columns 
(with each vertical column of tiles carrying one bit of a high-complexity sequence CO). Random errors make 
gaps in vertical columns, so now the columns are split into parts, which a priori can carry different bits. To 
overcome this problem we organize additional information flows between macro-tiles to guarantee that each 
infinite vertical column carries in most of its tiles one and the same bit value. 

13.2 General scheme 

Here we explain the general ideas of our proof. First, we use macro-tiles with variable zoom factors Nt = 
g[2.5 J ^ large enough integer Q> 0. This means that every fc-level macro-tile is an {N^-i x Nk-i) array 
of {k — l)-level macro-tiles. So the size (the number of columns and the number of rows) of a k-\ewe\ macro- 
tile is = No ■■ -Nk-i, and < Nk- (The constant 2.5 in our construction can be replaced by any rational 
number between 2 and 3.) 

To get tilings with high Kolmogorov complexity, we reuse the construction from Section |7] with the 
zoom factors defined above. Let us recall the idea of that construction (proof of Theorem |9l). In a correct 
tiling, in the ith column all tiles keep some bit ft),, and we want every N-hit substring in the corresponding 
biinfinite sequence CO to have Kolmogorov complexity Q.{N). To enforce this property we organize our 
computation on macro-tiles of all levels. The crucial point of the construction is propagation of bits ft), to the 
computation zones of macro-tiles of high levels. Let us recall the main points of this construction (following 
the argument from Section |7]l: 

• We say that for each (infinite) column of tiles in a tiling there is an assigned bit w,, which is "known" 
to each tile in the column. (In other words, there is a mapping that attributes to each tile the corre- 
sponding bit ft),-; vertically neighboring tiles must keep the same value of the bit.) 

• For a ^-level macro-tile (of size x L^) its zone of responsibility is the sequence of bits cOi assigned 
to all columns of this macro-tile. Vertically aligned macro-tiles of the same level have the same zone 
of responsibility. 

• For some k-level macro-tile M there is one delegated bit; this is a bit ft),- from the zone of responsibihty 
of this macro-tile. This bit must be known to the "consciousness" of the macro-tile; that is, it must be 
presented explicitly on the tape in the computation zone of this macro-tile. For technical reasons, we 
decide that the position of the delegated bit ft), in the zone of responsibility of M (this position is an 
integer between and L^^ — 1) is equal to the position (vertical coordinate) of M in its father macro-tile 
(see Fig. H). The father is a macro-tile of level k + l, which consists of x Nt macro-tiles of level 
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k (thus, the vertical coordinate of a ^-level macro-tile in its father ranges over 0,. . . ,Nk — V)- In our 
settings, Nk > Lk-\. If a /c-level macro-tile M has a vertical coordinate in its father greater than Nu, 
then M does not have a delegated bit. 

• If a /c-level macro-tile M has a delegated bit in its computation zone, it also contains a group of bits 
to check that starts at the delegated bit and has rather small length (say, log log log If this group of 
bits leaves the responsibility zone, we truncate it. The Turing machine simulated in the computation 
zone of M enumerates the forbidden strings of "too small Kolmogorov complexity" and verifies that 
the checked group of bits does not contain any of them. This process is bounded by time and space 
allocated to the computation zone of a ^-level macro-tile. 

The last item requires additional comments. Technically, we fix constants a € (0, 1) and c and check that 
for every string x in zones of responsibility of all macro-tiles K{x) > a\x\ —c. To check this property, a 
macro-tile enumerates all strings x of complexity less than a\x\ — c. This enumeration requires infinite time, 
though computations in each macro-tile are time-bounded. However, this is not a problem since every such 
X is checked in macro-tiles of arbitrarily high levels (i.e., if x is covered by a macro-tile of level k, then it is 
also covered by macro-tiles of all levels greater than k). Thus, we guarantee the following property: 

For every k-level macro-tile M {k= 1,2, . . .), and for every substring xofco 

that is contained in M's zone of responsibility {its horizontal projection), it (*) 

holds that K{x) '>Ol,\x\— c. 

Notice that K{x) > a\x\ — c holds only for strings x covered by some macro-tile (i.e., strings that belong to 
some macro-tile's zone of responsibility). In "degenerate" tilings there can exist an infinite vertical line that 
is a border line for macro-tiles of all levels (see Fig.|6ll. A string x that intersects this line is not covered by 
any macro-tile of any level. Hence, (*) does not guarantee for such a string x that its Kolmogorov complexity 
is greater than a\x\ — c. However, as we noticed in Section 1741 the parts of x on both sides of the boundary 
are covered by some macro-tile. Hence, it follows from (*) that K{x) > -f |;c| — 0(1) = for all factors 

X of the biinfinite string ft). 

Thus, we reuse the argument from Section |2l and it works well if there are no errors, but when we 
introduce random errors, the old construction is broken. Indeed, vertical columns can be damaged by islands 
of errors. Now we need to make an effort to enforce that copies of ft),- consciously kept by different macro- 
tiles are coherent (at least for macro-tiles that are not seriously damaged by local errors). To this end we will 
use some checksums, which guarantee that neighbor macro-tiles keep coherent conscious and subconscious 
information. We discuss this topic in the next section. 

To deal with random errors we use the technique of bi-islands (see Section |94l ). Our arguments work if 
diameters of k-level bi-islands are comparable with the size of ^-level macro-tiles. Technically, we set at = 
26L/t-i and jS^ = 2Li. (In the following we will see that this choice of is important for the error-correcting 
procedure; at is set to I3pk-i^ so that lemmas on bi-islands can be applied.) Recall that = 2^2-5 J and 
Lk =No-- -Nk-i- Note that Lemmas [6] and |7] can be used with these values of parameters a, jS. We will also 
employ Lemma [8] with Yk = 0{ak)- 

13.3 The new construction of the tile set 

We take the construction from Section|7]as the starting point and superimpose some new structures on ^-level 
macro-tiles. We introduce these supplementary structures in several steps. 
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First step (introducing checksums): Every ^-level macro-tile M (in a correct tiling) consists of an 
Nk-i X Nk-\ array of {k — l)-level macro-tiles; each of these {k — l)-level macro-tiles may keep one del- 
egated bit. Let us take one horizontal row (bits assigned to A'^fc-i macro-tiles of level k — \) in this two- 
dimensional array of size Nk-\ xNk-\. Denote the corresponding sequence of bits by rji, . . . ,7]^^ ^. We 
introduce a sort of erasure code for this string of bits. In other words, we will calculate some checksums 
for this sequence. These checksums should be suitable to reconstruct all bits T]i, . . . , if at most D of 
these bits are erased (i.e., if we know values Tj, for only Nk-\ — D positions); here D > is a constant (to 
be fixed later). We want the checksums to be easily computable. Here we use again the checksums of the 
Reed-Solomon code (discussed in Section |6]). 

Let us explain this technique in more detail. We take a finite field F^: of large enough size (greater than 
Nk-i +D). Then, we calculate a polynomial of degree less than A'yt-i that takes values Tji , . . . , T]Arj_, at some 
Nk- 1 points of the field. Further, we take as checksums the values of this polynomial at some other D points 
from (where all (A'^t-i +D) points of the field are fixed in advance). Two polynomials of degree less than 
Nk-i can coincide in at most (A'yt-i — 1) points. Hence, if D bits from the sequence T]i , . . . , T]iVj_, are erased, 
we can reconstruct them given the other (nonerased) bits rjj and the checksums defined above. 

These checksums contain C?(logA'^i._i) bits of information. We next discuss how to compute them. 

Second step (calculating checksums): First, we explain how to compute the checksums, going from 
left to right along the sequence Tji , . . . , T]A?t_, . This can be done in a rather standard way as follows. 

Let T]i , . . . , ^Nk-i be the values of a polynomial p{x) (of degree less than A'^fc-i) at points xi , . . . ,xyvj_i . 
Assume we want to reconstruct all coefficients of this polynomial. We can do this by the following iter- 
ative procedure. For / = 1, . . . ,Nk-i we calculate polynomials pi{x) and qi{x) (of degree < (/ — 1) and /, 
respectively) such that 

Piixj) = r]j for j= 1,...,/ 

and 

qi{x) = {x-xi)---{x-Xi). 

It is easy to see that for each /, polynomials and (7,+i can be computed from polynomials p,- and qi and 
the values x;+i and T],_|_i. 

If we do not need to know the resulting polynomial p = PNt-i {^) but want to get only the value p{a) at 
some particular point a, then we can perform all these calculations modulo {x — a). Thus, to obtain the value 
of p{x) at D different points, we run in parallel D copies of this process. At each step of the computation we 
need to keep in memory only 0(1) elements of F^, which is C?(logA'yt-i) bits of temporary data (with the 
multiplicative constant in this 0{-) notation depending on the value of D). 

This calculation can be simulated by a tiling. We embed the procedure just explained into the com- 
putation zones of {k— 1) -level macro-tiles. The partial results of the calculation are transferred from one 
{k — 1) -level macro-tile to another one, from the left to the right (in each row of length A'^t-i in a /c-level 
macro-tile). The final result (for each row) is embedded into the conscious information (bits on the tape of 
the Turing machine in the computation zone) of the rightmost {k — 1) -level macro-tile of the row. 

To organize these computations, we need to include into conscious information kept by {k — l)-level 
macro-tiles additional C?(logA'jt_i) bits and add the same number of bits to their macro-colors. This fits well 
our fixed-point construction since zoom factors Nk grow fast, and we have enough room in the computation 
zone. 

Third step (consistency of checksums between macro-tiles): So far, every k-\&ve\ macro-tile contains 
0{Nk-\logNk-\) bits of checksums and C?(logA'^fc_i) bits for every row. We want these checksums to be 
the same for every two vertical neighbor macro-tiles. It is inconvenient to keep the checksums for all rows 
only in the rightmost column (since it would create too much traffic in this column if we try to transmit the 
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checksums to the neighbor macro-tiles of level k). So we propagate the checksums of the ith row in a ^-level 
macro-tile M {i= 1, . . . ,Nk-i) along the entire ith row and along the entire ith column of M. In other words, 
these checksums must be "consciously" known to all {k — l)-level macro-tiles in the ith row and in the ith 
column of M. In Fig. [16] we show the area of propagation of checksums for two rows (the ith and the jth 
rows). 




Figure 16: Propagation of checksums inside of a macro-tile. 

On the border of two neighbor ^-level macro-tiles (one above another) we check that in each column 
/ = l,...,Nk-\ all the corresponding checksums computed in both macro-tiles coincide. This check is 
redundant if there are no errors in the tiling: The checksums are computed from the delegated bits (which 
come from the sequence of bits (O encoded into tiles of the ground level), so the corresponding values for 
all vertically aligned macro-tiles must be equal to each other. However, this redundancy is useful to resist 
errors, as we show in the following. 

Fourth step (robustification): The features just explained organized in every ^-level macro-tile (bit 
delegation, computing and propagating checksums, and all the computations simulated in the computation 
zone of a macro-tile) are simulated by means of bits kept in the "consciousness" (i.e., in the computation 
zone) of {k — l)-level macro-tiles. Now we fix some constant C and "robustify" this construction in the 
following sense: Each {k — l)-level macro-tile M keeps in its consciousness not only "its own" data but 
also the bits previously assigned to {k — l)-level macro-tiles from its (C • L/:_i) -neighborhood (i.e., the 
(2C + 1) X (2C + 1) array of {k — l)-level macro-tiles centered at M). So, the content of the consciousness 
of each macro-tile is multiplied by some constant factor. Neighbor macro-tiles check that the data in their 
consciousness are coherent. 

We choose the constant C so that every ^-level bi-island (which consist of two parts of size ak) and 
even the = 0(o;<;) -neighborhood of every ^-level bi-island (where we specify ji^ below) can involve only 
a small part of the (CLj:_i) -neighborhood of any {k — 1) -level macro-tile. (Note that here we talk about 
neighborhoods, not about extended neighborhoods of bi-islands defined in Section [9!4l ) 

This robustification allows us to reconstruct the conscious information of a ^-level macro-tile and of its 
{k — 1) -level sons when this macro-tile is damaged by one k-\evel bi-island (assuming there are no other 
errors). 

The last remark (the number of bits in the consciousness of a macro- tile): The construction ex- 
plained above requires that we put into the computation zones of all {k — 1) -level macro-tiles additional 
poly (log A/^t-i) bits of data. (The most substantial part of the data is the information used to compute the 
checksums.) Again, this fits our fixed-point construction because poly (log A/jt-i) is much less than Nk-2, so 
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we have enough room to keep and process all these data. 

The tile set T is thus defined. Since there exists an co with Levin's property, it follows that T-tiling exists, 
and every N x N square of such a tiling has Kolmogorov complexity D.{N). Further, we prove that this T 
satisfies also statement (3) of Theorem [T4l 

13.4 Error-correcting procedure 

Denote by T the tile set described in Section [13.31 Let e > be small enough. Lemma [12] says that a Bg- 
random set with probability 1 is bi-sparse. Now we assume that £ C is a bi-sparse set (for the chosen 
values of a,- and j8,), and T is a T-tiling of 7?\E. Further, we explain how to correct errors and convert T 
into a tiling T' of the entire plane (where T' should be close to T). 

We follow the usual strategy. The set E is bi-sparse; that is, it can be represented as a union of isolated 
bi-islands of different ranks. We correct them one by one, starting from bi-islands of low ranks. To prove 
that the correction procedure converges, we need to explain one step of this process: how to correct one 
bi-island S of rank k assuming that it is well isolated, i.e., in the jSj.-neighborhood of this bi-island there are 
no other (still noncorrected) errors. 

Let us recall that a ^-level bi-island S is a union of two "clusters" ^o, ; the diameters of both So and S\ 
are at most ttyt = 0(L/._i). Hence the clusters and Si touch only 0(1) macro-tiles of level {k — 1). The 
distance between and Si is at most P^, and the j3i:-neighborhood of S is free of other bi-islands of rank k 
and higher (so we can assume that the jS^-neighborhood of S is already cleaned of errors). Our correction 
procedure around S will involve only points in the extended 7<:-neighborhood of S, where = lau- 

Let M be one of ^-level macro-tiles intersecting the extended -neighborhood of the ^-level bi-island S. 
Basically, we need to reconstruct all {k — 1) -level macro-tiles in M destroyed by S. First, we will reconstruct 
the conscious information in all {k — 1) -level macro-tiles in M. This is enough to get all bits of (O from the 
"zone of responsibility" of M. Then, we will reconstruct in a consistent way all ?i-level macro-tiles inside M 
for all n < k. 

Thus, we start with reconstructing the consciousness of all {k — 1) -level macro-tiles M' in M. First, 
we recall that the consciousness (the content of the computation zone) of every {k — 1) -level macro-tile M' 
consists of several groups of bits (cf. the outline of the construction in Section [T3.2[ p.[4TI): 

[A] the binary representation of the number — 1) and coordinates (integers from the range 0, . . . ,Nk-i — 
1) of M' in the father macro-tile M; 

[B] the bits used to simulate a Turing machine on the computation zone of M and the bits used to imple- 
ment "wires" of M; 

[C] the bit (from the sequence ft)) delegated to M'; 

[D] the bit (from ft)) delegated to M; 

[E] the bits used to calculate and communicate the checksums for the corresponding row of {k — 1) -level 
macro-tiles in M; and 

[F] a group of bits to check from the zone of responsibility of M'; these bits are checked by the macro-tile: 
M' checks on its computation zone that this "group of bits to check" does not contain any factor of 
low Kolmogorov complexity. 
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Bits of field [A] in a small isolated group of {k— 1) -level macro-tiles are trivially reconstructed from the 
surrounding macro-tiles of the same level. Fields [B], [C], [D], and [E] can be reconstructed because of the 
robustification on the level of {k — 1) -level macro-tiles. (We organized the robustification on the level of 
{k — 1) -level macro-tiles in such a way that we are able to reconstruct these fields for any C x C group of 
missing or corrupt {k — l)-level macro-tiles.) So far the correcting procedure follows the exactly the same 
steps as in Section [TT] 

To reconstruct fields [F] of {k — 1) -level macro-tiles in M, we need to reconstruct all bits of co from 
the zone of responsibility of M. We can extract these bits from the neighbor ^-level tiles above or below 
M. (Recall that bi-island S touches only 0(1) ^-level macro-tiles, and there is a "healthy" zone of ^-level 
macro-tiles around them.) However, a problem remains since we are not sure that the co bits above M, below 
M, and inside M are consistent. Now we show that this consistency is guaranteed by checksums. 

Denote by M„ and the /c-level macro-tiles just above and below S. Since the distance between S and 
other ^-level bi-islands is greater than jSj. = 2L^, we know that M„ and must be free of errors (where we 
assume that errors of ranks less than k are already corrected). See Fig.[T7j In what follows, our explanations 
refer to Fig. [T7J where bi-island 5 touches only one k-\eve\ macro-tile; if S touches several ^-level macro- 
tiles, substantially the same arguments work. It is enough to prove that the bits 0), assigned to corresponding 
columns of M„ and in are equal to each other. 



macro-tile M„ without errors 




macro-tile M with an error bi-island 



macro-tile Mj without errors 



Figure 17: Bi-island of errors in a macro-tile. 

The macro-tiles M„ and are error free; therefore, the sequences of bits cOi corresponding to the 
vertical lines intersecting these /c-level macro-tiles are well defined. Since there are no errors, the conscious 
information (including checksums) in all macro-tiles of all levels inside Mj, and is consistent with these 
bit sequences. So, the Lk bits assigned to the vertical columns are correctly delegated to the corresponding 
{k — 1 ) -level macro-tiles inside M„ and M^. However, it is not evident that the sequences of bits embedded 
in M„ and are equal to each other. 

In fact, it is easy to see that bit sequences for M„ and coincide with each other at most positions. 
They must be equal for all columns (from the range 0, . . . ,L<: — 1) that do not intersect bi-island S (i.e., in 
nondamaged columns of tiles on the ground level, the assigned bits ft), correctly spread though macro-tiles 
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Mu, M, and Mj). Hence, the bits delegated to the corresponding {k — 1)-Ievel macro-tiles in M„ and are 
equal to each other, except for only {k— l)-level macro-tiles in the "gray zone" of Fig.[T7J which contains the 
{k — l)-level macro-tiles involved in the correction of S and all vertical stripes touching the involved sites. 
(The width of this gray stripe is only 0(1) macro-tiles of level {k — 1).) Hence, for / = 0, . . . , [Nt-i — 1), 
in the ith rows of {k — 1) -level macro-tiles in M„ and M^, the sequences of delegated bits are equal to each 
other except possibly for only 0(1) bits (delegated to {k— l)-level macro-tiles in the "gray zone"). 

The robustness property guarantees that all checksums are correctly transmitted through M. Hence, 
checksums for corresponding rows in M„ and in must be equal to each other 

Thus, for every two corresponding rows of {k — 1 ) -level macro-tiles in M„ and in we know that (a) all 
except 0(1) delegated bits in the corresponding positions are equal to each other and (b) the checksums are 
equal to each other From the property of our erasure code it follows that in fact all delegated bits in these 
rows are equal to each other (with every ith bit in being equal to the ith bit in M^). Therefore, all bits ft), 
in Mu and are the same (on the ground level). We can use these bits to reconstruct subconsciousness of 
M and get a consistent tiling in M. 

We are almost done. Bi-island S is corrected; we reconstructed conscious information for the k-level 
macro-tile M and for all its {k — l)-level sons. Now we can reconstruct fields [F] in the damaged (^ — 1)- 
level macro-tiles inside M. This is simple to do. We just take the corresponding bits CO; from the zone of 
responsibility (shared by M, Mu, and M^). It remains only to explain why the checking procedure does not 
fail for these groups of bits (i.e., {k — l)-level macro-tiles do not discover in these bit strings any factors of 
low Kolmogorov complexity). But this is true because macro-tiles of levels — 1) (and also below — 1)) 
inside M apply exactly all the same checks to exactly the same groups of bits ft), as the macro-tiles in the 
corresponding positions in M„ and M^. Since there is no errors in M„ and Mj, these computations do not 
lead to a contradiction. 

Let us inspect again the correction procedure just explained; we should notice which tiles are involved in 
the error-correcting process around bi-island S. In the {k — l)-level macro-tiles outside the "gray zone" we 
change nothing. Moreover, not all the gray zone needs to be changed — only the part between two clusters 
of S (and their small neighborhoods) is affected. Indeed, in all tiles of M that are above 5 the assigned bits 
ft); are the same as in the corresponding columns of M„; in the tiles of M that are below 5 the assigned bits 
COi are the same as in the corresponding columns of M^. Hence, there is no need to correct "subconscious 
information" of {k — 1) -level macro-tiles that are above or below S. Only the area between two clusters of 
S requires corrections. More precisely, the area involved in the correcting procedure is inside the extended 
neighborhood of S. (In fact, this argument is the motivation of our definition of extended neighborhood.) 

Thus, we have proven that this step-by-step correcting procedure eliminates all bi-islands of errors and 
only extended 7i.-neighborhoods of A;-level bi-islands are involved in this process. Now Theorem [14] (part 3) 
follows from Lemma[8] It remains only to prove part 4 of the theorem. We do this in the next section. 

13.5 Levin's property for (o embedded into a (t, £)-tiling 

It remains to prove part (4) of Theorem [TH In the previous section we proved that if the set of errors E is 
bi-sparse, then a (T,£')-tiling T can be converted into a T-tiling T' of the entire plane, and the difference 
between T and T' is covered by extended Jk neighbors of ^-level bi-islands from £ (^ = 0, 1, . . .). Now we 
want to show that, in the initial tiling T, the Kolmogorov complexity of centered squares of size N xN was 

Fix a point O. Since E is bi-sparse, O is covered by j8i:-neighborhoods of only finitely many bi-islands. 
Hence, for large enough A, the A x A square Qa centered at O intersects extended y^; -neighborhoods of k- 
level bi-islands only if < ^- (If the extended 7<;-neighborhood of some bi-island intersects and fik > A, 
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then j8;t — 7<; > A/2 and O is covered by the j3yt -neighborhood of this bi-island.) Therefore, to reconstruct T' 
in 2a it is enough to correct there all bi-islands of bounded levels (such that j3yt < A). 

To reconstruct T' in 2a we need to know the original tiling T in 2a and some neighborhood around 
it (i.e., in some centered 0(A) x 0(A) square 2a'> which is only greater than 2a by a constant factor). 
Indeed, given the tiling T restricted on 2a'> we can locally correct there bi-islands of levels 1,2, ... ,^ (such 
that j3i: < A) one by one. Correcting a bi-island of errors in 2a' we obtain the same results as in the error- 
correcting procedure on the entire plane I? unless this bi-island is too close to the border of 2a' (and the 
local correction procedure should involve information outside 2a')- Thus, we can reconstruct T'-tiling not 
in the entire 2a' but in points that are far enough from the border of this square. If A' = cA for large enough 
c, then 2a' provides enough information to reconstruct T' in 2a- 

We know that Kolmogorov complexity of error-free tiling T' in 2a is n(A). Therefore, the Kolmogorov 
complexity of the original T-tiling in the greater square 2a' is also n(A). Since A' is only greater than A 
by a constant factor, we get that the Kolmogorov complexity of the (!,£■) -tiling T restricted to the centered 
(A' X A') square is n(A'). 

Theorem [14] is proven. 
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